Dawn's picture

3

Dawn

LegendaryDawn

·

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

LegendaryDawn/mbpo-adv_dpo001-shard2-adv_8_64_rankmix20-dapo-n8-qwen2_5_vl_3b-step300

published a model 3 days ago

LegendaryDawn/mbpo-adv_dpo001-shard2-adv_8_64_rankmix20-dapo-n8-qwen2_5_vl_3b-step300

updated a model 3 days ago

LegendaryDawn/mbpo-adv_neg_replace-adv_8_64_rankmix20-dapo-n8-qwen2_5_vl_3b-step300

View all activity

Organizations

None yet

upvoted 2 papers 2 months ago

Prepare Reasoning Language Models for Multi-Agent Debate with Self-Debate Reinforcement Learning

Paper • 2601.22297 • Published Jan 29 • 3

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

upvoted a paper 5 months ago

Explore Data Left Behind in Reinforcement Learning for Reasoning Language Models

Paper • 2511.04800 • Published Nov 6, 2025 • 1