Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Dawn's picture
3

Dawn

LegendaryDawn
John6666's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago
LegendaryDawn/mbpo-adv_dpo001-shard2-adv_8_64_rankmix20-dapo-n8-qwen2_5_vl_3b-step300
published a model 3 days ago
LegendaryDawn/mbpo-adv_dpo001-shard2-adv_8_64_rankmix20-dapo-n8-qwen2_5_vl_3b-step300
updated a model 3 days ago
LegendaryDawn/mbpo-adv_neg_replace-adv_8_64_rankmix20-dapo-n8-qwen2_5_vl_3b-step300
View all activity

Organizations

None yet

upvoted 2 papers 2 months ago

Prepare Reasoning Language Models for Multi-Agent Debate with Self-Debate Reinforcement Learning

Paper • 2601.22297 • Published Jan 29 • 3

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55
upvoted a paper 5 months ago

Explore Data Left Behind in Reinforcement Learning for Reasoning Language Models

Paper • 2511.04800 • Published Nov 6, 2025 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs