r1

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

JasperHaozhe authored a paper 1 day ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

JasperHaozhe authored a paper 1 day ago

Reverse-Engineered Reasoning for Open-Ended Generation

JasperHaozhe authored a paper 1 day ago

VideoScore2: Think before You Score in Generative Video Evaluation

View all activity

JasperHaozhe

authored 13 papers 1 day ago

Dr. Bench: A Multidimensional Evaluation for Deep Research Agents, from Answers to Reports

Paper • 2510.02190 • Published Jan 29 • 19

Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing

Paper • 2510.15349 • Published Oct 17, 2025

TR-DQ: Time-Rotation Diffusion Quantization

Paper • 2503.06564 • Published Mar 9, 2025

From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning

Paper • 2511.23031 • Published Nov 28, 2025 • 1

CogDoc: Towards Unified thinking in Documents

Paper • 2512.12658 • Published Dec 14, 2025

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 92

Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

Paper • 2603.11103 • Published Mar 11 • 9

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

Paper • 2603.16124 • Published Mar 17 • 3

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 20 days ago • 143

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published 5 days ago • 99

JasperHaozhe

submitted a paper to Daily Papers 2 days ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published 5 days ago • 99

che111

submitted a paper to Daily Papers 19 days ago

MedOpenClaw: Auditable Medical Imaging Agents Reasoning over Uncurated Full Studies

Paper • 2603.24649 • Published 24 days ago • 31

JasperHaozhe

authored a paper 8 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 81

JasperHaozhe

authored 4 papers 10 months ago

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21, 2025 • 53

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Paper • 2505.20139 • Published May 26, 2025 • 19

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Paper • 2505.17952 • Published May 23, 2025 • 20

Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models

Paper • 2505.19509 • Published May 26, 2025 • 7

AI & ML interests

Recent Activity

Team members 2

medr1's activity