Yunshan Ma
ysma
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Video-Based Reward Modeling for Computer-Use Agents authored a paper 4 months ago
FashionDPO:Fine-tune Fashion Outfit Generation Model using Direct
Preference Optimization upvoted a paper 7 months ago
Quantile Advantage Estimation for Entropy-Safe ReasoningOrganizations
None yet