Hsg8l24mya3's picture

Hsg8l24mya3

hsg8l24mya3

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

liked a model 4 days ago

tencent/HY-Embodied-0.5

upvoted a paper 6 days ago

Tunable Soft Equivariance with Guarantees

View all activity

Organizations

None yet

upvoted a paper 3 days ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published 6 days ago • 99

upvoted a paper 6 days ago

Tunable Soft Equivariance with Guarantees

Paper • 2603.26657 • Published 22 days ago • 5

upvoted a paper 7 days ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published 17 days ago • 480

upvoted a paper 8 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 16 days ago • 361

upvoted 2 papers 9 days ago

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Paper • 2604.03016 • Published 16 days ago • 37

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published 10 days ago • 114

upvoted 2 papers 18 days ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published 20 days ago • 340

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 30 days ago • 338

upvoted 2 papers about 1 month ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 369

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 194

upvoted 2 papers about 2 months ago

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Paper • 2603.03241 • Published Mar 3 • 87

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

upvoted 2 papers 2 months ago

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 244

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 208