19 21

Xiangyu

xixy

https://xixy.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 21 hours ago

On the Role of Reasoning Patterns in the Generalization Discrepancy of Long Chain-of-Thought Supervised Fine-Tuning

upvoted a paper 6 days ago

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

upvoted a paper 7 days ago

On the Role of Reasoning Patterns in the Generalization Discrepancy of Long Chain-of-Thought Supervised Fine-Tuning

View all activity

Organizations

None yet

authored a paper about 21 hours ago

On the Role of Reasoning Patterns in the Generalization Discrepancy of Long Chain-of-Thought Supervised Fine-Tuning

Paper • 2604.01702 • Published 10 days ago • 3

upvoted a paper 6 days ago

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Paper • 2604.04323 • Published 8 days ago • 38

upvoted a paper 7 days ago

On the Role of Reasoning Patterns in the Generalization Discrepancy of Long Chain-of-Thought Supervised Fine-Tuning

Paper • 2604.01702 • Published 10 days ago • 3

commented a paper 7 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published 13 days ago • 36 •

authored a paper 18 days ago

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published 23 days ago • 77

New activity in Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled about 1 month ago

Claude distillation

❤️➕ 2

#1 opened about 1 month ago by

gergopool

upvoted a paper about 1 month ago

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Paper • 2603.02578 • Published Mar 3 • 25

authored 6 papers 3 months ago

upvoted 2 papers 3 months ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 180

Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Paper • 2601.10355 • Published Jan 15 • 39

upvoted a paper 4 months ago

Universal Reasoning Model

Paper • 2512.14693 • Published Dec 16, 2025 • 44

commented 2 papers 4 months ago

Rethinking Expert Trajectory Utilization in LLM Post-training

Paper • 2512.11470 • Published Dec 12, 2025 • 10 •

State over Tokens: Characterizing the Role of Reasoning Tokens

Paper • 2512.12777 • Published Dec 14, 2025 • 5 •

commented 2 papers 5 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 134 •

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published Nov 9, 2025 • 53 •

Xiangyu

AI & ML interests

Recent Activity

Organizations

xixy's activity

Claude distillation