1 23 5

Kaiyuan Chen

Lucky2022

https://chenky9106.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Test-Time Scaling Makes Overtraining Compute-Optimal

liked a model 9 days ago

zai-org/GLM-5.1

liked a model 9 days ago

unsloth/GLM-5.1-GGUF

View all activity

Organizations

upvoted a paper 8 days ago

Test-Time Scaling Makes Overtraining Compute-Optimal

Paper • 2604.01411 • Published 16 days ago • 28

upvoted a paper 18 days ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published 22 days ago • 50

upvoted 3 papers 30 days ago

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published about 1 month ago • 58

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Paper • 2603.12255 • Published Mar 12 • 91

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 180

upvoted a paper about 1 month ago

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Paper • 2603.07980 • Published Mar 9 • 27

upvoted a paper 2 months ago

AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios

Paper • 2601.20613 • Published Jan 28 • 10

upvoted a paper 3 months ago

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 201

upvoted a paper 4 months ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 39

upvoted 3 papers 5 months ago

upvoted a collection 8 months ago

Seed-OSS

Collection

Seed-OSS Open-Source Models • 3 items • Updated Aug 20, 2025 • 61

upvoted a paper 10 months ago

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Paper • 2506.13651 • Published Jun 16, 2025 • 8

upvoted 3 papers 11 months ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21, 2025 • 98

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19, 2025 • 45

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

upvoted 2 papers 12 months ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21, 2025 • 78

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88

upvoted a paper about 1 year ago

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Paper • 2312.00849 • Published Dec 1, 2023 • 12

Kaiyuan Chen

AI & ML interests

Recent Activity

Organizations

Lucky2022's activity