16 14

郭思宇

songwe1xj

AI & ML interests

Embodied AI and robotics prototypes. Mostly focused on experiments.

Recent Activity

upvoted a paper about 7 hours ago

Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines

upvoted a paper about 9 hours ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

upvoted a paper 3 days ago

ELF: Embedded Language Flows

View all activity

Organizations

None yet

upvoted a paper about 7 hours ago

Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines

Paper • 2605.20630 • Published 1 day ago • 9

upvoted a paper about 9 hours ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 10 days ago • 117

upvoted a paper 3 days ago

ELF: Embedded Language Flows

Paper • 2605.10938 • Published 11 days ago • 14

upvoted a paper 16 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 19 days ago • 161

upvoted a paper 27 days ago

EasyVideoR1: Easier RL for Video Understanding

Paper • 2604.16893 • Published Apr 18 • 40

upvoted a paper 28 days ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 30 days ago • 240

upvoted 4 papers about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 245

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 238

upvoted 2 papers about 2 months ago

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

Paper • 2603.29664 • Published Mar 31 • 49

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

upvoted 4 papers 2 months ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

郭思宇

AI & ML interests

Recent Activity

Organizations

songwe1xj's activity