ZhangXiaoyun
DadaCloud01
AI & ML interests
None yet
Recent Activity
upvoted a paper about 16 hours ago
SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting authored a paper 27 days ago
Rediscovering Entropy Regularization: Adaptive Coefficient Unlocks Its
Potential for LLM Reinforcement Learning authored a paper 27 days ago
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters