12 26 25

ZhuofengLi

https://github.com/Zhuofeng-Li

AI & ML interests

Agents, Reasoning LLMs/VLLMs, RL

Recent Activity

updated a dataset 6 days ago

ZhuofengLi/bcp-eval-logs

published a dataset 6 days ago

ZhuofengLi/bcp-eval-logs

updated a dataset 6 days ago

ZhuofengLi/bcplus-eval-100

View all activity

Organizations

upvoted a paper 7 days ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 9 days ago • 255

upvoted a paper 9 days ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published 12 days ago • 35

upvoted a paper 17 days ago

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published 19 days ago • 30

upvoted a paper 23 days ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published about 1 month ago • 94

upvoted a collection 2 months ago

MMMU

Collection

MMMU Dataset • 2 items • Updated Feb 11 • 1

upvoted a paper 2 months ago

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Paper • 2602.08847 • Published Feb 9 • 29

upvoted a collection 2 months ago

OpenResearcher

Collection

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis • 8 items • Updated 24 days ago • 17

upvoted a paper 2 months ago

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36

upvoted 6 papers 6 months ago

upvoted 4 papers 7 months ago

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Paper • 2509.26346 • Published Sep 30, 2025 • 19

Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning

Paper • 2509.22824 • Published Sep 26, 2025 • 21

VideoScore2: Think before You Score in Generative Video Evaluation

Paper • 2509.22799 • Published Sep 26, 2025 • 26

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 151

upvoted 2 papers 8 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 81

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 140

ZhuofengLi

AI & ML interests

Recent Activity

Organizations

ZhuofengLi's activity