2 27 2

Runpeng Dai

Leo-Dai

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

updated a dataset about 1 month ago

Leo-Dai/APO_AIME24

updated a dataset about 1 month ago

Leo-Dai/APO_AIME25

View all activity

Organizations

upvoted a paper 28 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 30 days ago • 138

updated 3 datasets about 1 month ago

liked a Space 2 months ago

Efficient Reasoning Online Judgement

📉

upvoted a paper 2 months ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 79

authored a paper 2 months ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published Feb 3 • 27

upvoted 2 papers 2 months ago

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Paper • 2602.03619 • Published Feb 3 • 28

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published Feb 3 • 27

upvoted a paper 3 months ago

RelayLLM: Efficient Reasoning via Collaborative Decoding

Paper • 2601.05167 • Published Jan 8 • 31

upvoted a paper 4 months ago

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing

Paper • 2512.10284 • Published Dec 11, 2025 • 26

upvoted 3 papers 5 months ago

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

Paper • 2511.21662 • Published Nov 26, 2025 • 11

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 111

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 44

commented a paper 5 months ago

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

Paper • 2510.09517 • Published Oct 10, 2025 • 8 •

liked a dataset 5 months ago

BlueZeros/EHR-Bench

Preview • Updated Nov 3, 2025 • 15 • 2

upvoted a paper 6 months ago

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 40

authored a paper 6 months ago

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

Paper • 2510.09517 • Published Oct 10, 2025 • 8

upvoted a paper 6 months ago

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

Paper • 2510.09517 • Published Oct 10, 2025 • 8

commented a paper 6 months ago

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

Paper • 2510.09517 • Published Oct 10, 2025 • 8 •

Runpeng Dai

AI & ML interests

Recent Activity

Organizations

Leo-Dai's activity

Efficient Reasoning Online Judgement