10 1

dsadsa

yueyue0407

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

upvoted a collection 8 days ago

Rethink_SFT_generalization

upvoted a paper 8 days ago

ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety

View all activity

Organizations

upvoted a paper 5 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 10 days ago • 314

upvoted a collection 8 days ago

Rethink_SFT_generalization

Collection

Repo for paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability. • 40 items • Updated 6 days ago • 16

upvoted a paper 8 days ago

ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety

Paper • 2604.02022 • Published 16 days ago • 15

liked a dataset about 1 month ago

AI45Research/ATBench

Viewer • Updated 8 days ago • 1.5k • 875 • 34

upvoted a paper about 1 month ago

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Paper • 2603.03202 • Published Mar 3 • 17

upvoted a paper about 2 months ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

Paper • 2602.14457 • Published Feb 16 • 29

upvoted a paper 3 months ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published Jan 26 • 125

upvoted a collection 3 months ago

AgentDoG

Collection

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated about 16 hours ago • 107

upvoted a paper 5 months ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published Nov 27, 2025 • 41

upvoted 2 papers 6 months ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9, 2025 • 22

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

Paper • 2509.26354 • Published Sep 30, 2025 • 18

dsadsa

AI & ML interests

Recent Activity

Organizations

yueyue0407's activity