E Sanchez's picture

9 9

E Sanchez

esanchez43

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

tencent/HY-Embodied-0.5

liked a dataset 2 days ago

FreedomIntelligence/medical-o1-reasoning-SFT

upvoted a paper 2 days ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

View all activity

Organizations

None yet

upvoted 2 papers 2 days ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published 6 days ago • 272

Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization

Paper • 2604.08476 • Published 5 days ago • 7

upvoted a paper 6 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 12 days ago • 352

upvoted a paper 12 days ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published 15 days ago • 339

upvoted a paper 26 days ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published 28 days ago • 368

upvoted a paper 28 days ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted 2 papers about 1 month ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 194

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

upvoted a paper about 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519