6 70 24

Minghui Jia

Maxwell-Jia

Maxwell-Jia

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SkillX: Automatically Constructing Skill Knowledge Bases for Agents

upvoted a paper 16 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

upvoted a paper 17 days ago

Towards a Medical AI Scientist

View all activity

Organizations

upvoted a paper 1 day ago

SkillX: Automatically Constructing Skill Knowledge Bases for Agents

Paper • 2604.04804 • Published 12 days ago • 32

upvoted a paper 16 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 28 days ago • 337

upvoted 2 papers 17 days ago

Towards a Medical AI Scientist

Paper • 2603.28589 • Published 18 days ago • 88

PRBench: End-to-end Paper Reproduction in Physics Research

Paper • 2603.27646 • Published 19 days ago • 29

upvoted a paper about 1 month ago

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Paper • 2602.08847 • Published Feb 9 • 29

upvoted 2 papers 2 months ago

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Paper • 2602.08990 • Published Feb 9 • 77

Towards Autonomous Mathematics Research

Paper • 2602.10177 • Published Feb 10 • 36

upvoted 2 papers 3 months ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86

Spec-o3: A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection

Paper • 2601.06498 • Published Jan 10 • 1

upvoted a paper 5 months ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 110

upvoted a collection 6 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 696

upvoted 3 papers 6 months ago

upvoted 3 papers 9 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 79

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 263

upvoted 2 papers 10 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 254

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

upvoted a paper 11 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

Minghui Jia

AI & ML interests

Recent Activity

Organizations

Maxwell-Jia's activity