Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CodeGoat24 's Collections
UnifiedReward 2.0 Qwen3.5 Models
UnifiedReward Flex
Pref-GRPO & UniGenBench
UnifiedReward Edit Models
UnifiedReward 2.0 Qwen3VL Models
UnifiedReward 2.0 Qwen2.5VL Models
UnifiedReward 1.0 Qwen2.5VL Models
UnifiedReward 1.0 Qwen2.5 Models GGUF
UnifiedReward 1.0 LLaVA Model
UnifiedReward Training Data

UnifiedReward 2.0 Qwen3.5 Models

updated Mar 16
Upvote
-

  • Unified Reward Model for Multimodal Understanding and Generation

    Paper • 2503.05236 • Published Mar 7, 2025 • 124

  • Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

    Paper • 2505.03318 • Published May 6, 2025 • 94

  • CodeGoat24/UnifiedReward-Think-qwen35-9b

    9B • Updated Mar 9 • 121

  • CodeGoat24/UnifiedReward-Think-qwen35-27b

    3.05M • Updated Mar 15 • 225

  • CodeGoat24/UnifiedReward-Think-qwen35-4b

    5B • Updated Mar 10 • 16 • 2

  • CodeGoat24/UnifiedReward-2.0-qwen35-9b

    9B • Updated Mar 7 • 1.66k • 3

  • CodeGoat24/UnifiedReward-2.0-qwen35-27b

    3.05M • Updated Mar 7 • 829

  • CodeGoat24/UnifiedReward-2.0-qwen35-4b

    5B • Updated Mar 7 • 11
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs