rubricreward/PolyGuardMix-tgt_prompt_en_thinking
Viewer
• Updated • 2.92M • 9
rubricreward/PolyGuardMix-en_prompt_en_thinking
Viewer
• Updated • 2.92M • 7
rubricreward/arena-human-preference-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 61.1k • 4
rubricreward/arena-human-preference-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 60.7k • 3
rubricreward/arena-human-preference-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 60.3k • 9
rubricreward/arena-human-preference-tgt_prompt_tgt_thinking
Viewer
• Updated • 120k • 4
rubricreward/arena-human-preference-tgt_prompt_en_thinking
Viewer
• Updated • 120k • 4
rubricreward/arena-human-preference-en_prompt_en_thinking
Viewer
• Updated • 120k • 21
rubricreward/PolyGuardMix
Viewer
• Updated • 2.93M • 3
rubricreward/mR3-Dataset-Filtered1
Viewer
• Updated • 696k • 4
rubricreward/PolyGuardMix-filtered-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 624k • 4
rubricreward/PolyGuardMix-filtered-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 631k • 4
rubricreward/PolyGuardMix-filtered-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 638k • 4
rubricreward/PolyGuardMix-filtered-tgt_prompt_tgt_thinking
Viewer
• Updated • 890k • 3
rubricreward/PolyGuardMix-filtered-tgt_prompt_en_thinking
Viewer
• Updated • 904k • 4
• 1
rubricreward/PolyGuardMix-filtered-en_prompt_en_thinking
Viewer
• Updated • 903k • 4
Viewer
• Updated • 40.5k • 4
rubricreward/HelpSteer3-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 12.7k • 4
rubricreward/HumanEval-XL-Python-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 3.1k • 2
rubricreward/MATH-500-Multilingual-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 4.57k • 4
rubricreward/MMMLU-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 148k • 4
rubricreward/HumanEval-XL-Python-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 3.2k • 4
rubricreward/MATH-500-Multilingual-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 4.83k • 5
rubricreward/MMMLU-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 158k • 4
rubricreward/HumanEval-XL-Python-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 3.09k • 4
rubricreward/MATH-500-Multilingual-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 4.59k • 4
rubricreward/MMMLU-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 148k • 4
rubricreward/arena-human-preference-en_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 51.4k • 4
rubricreward/HumanEval-XL-Python-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 3.25k • 4
rubricreward/MATH-500-Multilingual-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 4.83k • 3