Yi Shan's picture

74

Yi Shan

awangaddd

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

upvoted a paper 1 day ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

upvoted a paper 1 day ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

View all activity

Organizations

None yet

upvoted 5 papers 1 day ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published 6 days ago • 43

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 7 days ago • 163

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published 6 days ago • 273

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 6 days ago • 252

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published 7 days ago • 69

upvoted 8 papers 6 days ago

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Paper • 2604.04323 • Published 9 days ago • 39

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

Paper • 2604.03922 • Published 10 days ago • 53

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published 22 days ago • 135

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published 20 days ago • 29

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 20 days ago • 53

Hyperagents

Paper • 2603.19461 • Published 26 days ago • 49

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Paper • 2603.24533 • Published 20 days ago • 47

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published 20 days ago • 96

upvoted 4 papers 7 days ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published 13 days ago • 93

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published 9 days ago • 116

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 9 days ago • 200

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published 18 days ago • 351

upvoted a paper 26 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 28 days ago • 138

upvoted a paper 28 days ago

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Paper • 2603.13391 • Published Mar 11 • 19

upvoted a paper 29 days ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published Mar 13 • 43