rubricreward/R3-Qwen3-14B_merged_linear_6model
Text Generation
• 15B • Updated • 7
rubricreward/R3-Phi-4-reasoning-plus-LoRA-5K-v1.1
15B • Updated rubricreward/R3-Qwen3-4B_merged_linear_5model
Text Generation
• 4B • Updated • 2
rubricreward/R3-Qwen3-8B_merged_linear_6model
Text Generation
• 8B • Updated • 3
rubricreward/R3-Qwen3-14B-LoRA-15K-v1.1
15B • Updated rubricreward/R3-Qwen3-14B-LoRA-5K-v1.1
15B • Updated rubricreward/R3-Qwen3-4B_merged_linear_6model
Text Generation
• 4B • Updated • 2
rubricreward/R3-Qwen3-8B-LoRA-5K-v1.1
8B • Updated rubricreward/R3-Qwen3-4B-LoRA-5K-v1.1
4B • Updated • 1
rubricreward/R3-Qwen3-4B-15K-v1.1
Text Generation
• 4B • Updated • 8
rubricreward/R3-Qwen3-4B-5K-v1.1
Text Generation
• 4B • Updated • 8
rubricreward/R3-Qwen3-14B_merged_linear_5model
Text Generation
• 15B • Updated • 5
rubricreward/R3-Qwen3-14B-Skywork
15B • Updated • 6
rubricreward/R3-Qwen3-14B_merged_linear_4model
Text Generation
• 15B • Updated • 5
rubricreward/R3-Qwen3-4B_merged_linear_old2
Text Generation
• 4B • Updated • 2
rubricreward/R3-Qwen3-14B_merged_ties
Text Generation
• 15B • Updated • 5
rubricreward/R3-Qwen3-8B_merged_ties
Text Generation
• 8B • Updated • 2
rubricreward/R3-Qwen3-4B_merged_ties
Text Generation
• 4B • Updated • 2
rubricreward/R3-Qwen3-4B_merged_linear
Text Generation
• 4B • Updated • 2
rubricreward/R3-Qwen3-8B_merged_linear
Text Generation
• 8B • Updated • 2
rubricreward/R3-Qwen3-14B_merged_linear
Text Generation
• 15B • Updated • 4
rubricreward/R3-Phi-4-reasoning-plus-14k
Text Generation
• 15B • Updated • 62
• 2
rubricreward/R3-Phi-4-reasoning-plus-4k
Text Generation
• 15B • Updated • 4
• 1
rubricreward/R3-Phi-4-reasoning-plus-LoRA-4k
Text Generation
• 15B • Updated • 9
• 1
rubricreward/R3-Qwen3-4B-14k
Text Generation
• 4B • Updated • 221
• 1
rubricreward/R3-Qwen3-4B-4k
Text Generation
• 4B • Updated • 7
• 1
rubricreward/R3-Qwen3-4B-LoRA-4k
Text Generation
• 4B • Updated • 11
• 1
rubricreward/R3-Qwen3-8B-14k
Text Generation
• 8B • Updated • 195
• 1
rubricreward/R3-Qwen3-8B-4k
Text Generation
• 8B • Updated • 8
• 1
rubricreward/R3-Qwen3-8B-LoRA-4k
Text Generation
• 8B • Updated • 12
• 1