Haozhe Wang's picture

Haozhe Wang PRO

JasperHaozhe

·

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

authored a paper 2 days ago

Reverse-Engineered Reasoning for Open-Ended Generation

authored a paper 2 days ago

VideoScore2: Think before You Score in Generative Video Evaluation

View all activity

Organizations

upvoted a paper 3 days ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published 6 days ago • 99

upvoted a collection 5 days ago

RationalRewards

A Reasoning Reward Model that Scale Image Generation Both Training and Test Time • 6 items • Updated 3 days ago • 2

upvoted a paper 5 months ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 111

upvoted 2 papers 7 months ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Paper • 2509.03646 • Published Sep 3, 2025 • 33

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 151

upvoted 4 papers 8 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 127

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 81

Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

Paper • 2509.02040 • Published Sep 2, 2025 • 15

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

Paper • 2508.11987 • Published Aug 16, 2025 • 73

upvoted a paper 9 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8, 2025 • 94

upvoted 2 papers 11 months ago

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2, 2025 • 48

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21, 2025 • 53

upvoted a collection about 1 year ago

VL-Rethinker

SoTA VLM for Reasoning • 7 items • Updated May 5, 2025 • 6

upvoted 2 papers about 1 year ago

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10, 2025 • 44

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Mar 19, 2025 • 62