arxiv:2604.04872
Zhuokai Zhao
zhuokai
AI & ML interests
Data-Efficient Learning, LLM Reasoning and Safety, Active Learning, Recommender System
Recent Activity
authored a paper 4 days ago
Synthetic Sandbox for Training Machine Learning Engineering Agents upvoted a paper 5 days ago
Synthetic Sandbox for Training Machine Learning Engineering Agents authored a paper 3 months ago
Preference Optimization with Multi-Sample Comparisons