MoeReward/combined_preference_dataset_qwen2.5_sft_alpaca_heavy
Viewer
• Updated • 10k • 3
MoeReward/combined_preference_dataset_qwen2.5_sft_qa_heavy
Viewer
• Updated • 9.23k • 3
MoeReward/combined_preference_dataset_qwen2.5_sft_coding_heavy
Viewer
• Updated • 10k • 3
MoeReward/combined_preference_dataset_qwen2.5_sft_math_heavy
Viewer
• Updated • 10k • 3
MoeReward/combined_preference_dataset_qwen2.5_sft_equal_dist
Viewer
• Updated • 10k • 3
MoeReward/combined_preference_dataset_qwen2.5_sft
Viewer
• Updated • 81.3k • 3
MoeReward/combined_preference_dataset_olmoe_sft
Viewer
• Updated • 61.7k • 3
MoeReward/combined_preference_dataset_olmoe_base
Viewer
• Updated • 66.7k • 3
MoeReward/combined_preference_dataset_olmoe_base_alpaca_heavy
Viewer
• Updated • 10k • 3
MoeReward/combined_preference_dataset_olmoe_base_qa_heavy
Viewer
• Updated • 9.23k • 3
MoeReward/combined_preference_dataset_olmoe_base_coding_heavy
Viewer
• Updated • 9.92k • 3
MoeReward/combined_preference_dataset_olmoe_base_math_heavy
Viewer
• Updated • 10k • 3
MoeReward/combined_preference_dataset_olmoe_base_equal_dist
Viewer
• Updated • 10k • 3
MoeReward/combined_preference_dataset_qwen1.5_base_alpaca_heavy
Viewer
• Updated • 10k • 2
MoeReward/combined_preference_dataset_qwen1.5_base_qa_heavy
Viewer
• Updated • 9.23k • 2
MoeReward/combined_preference_dataset_qwen1.5_base_coding_heavy
Viewer
• Updated • 10k • 3
MoeReward/combined_preference_dataset_qwen1.5_base_math_heavy
Viewer
• Updated • 10k • 3
MoeReward/combined_preference_dataset_qwen1.5_base_equal_dist
Preview
• Updated • 3
MoeReward/combined_preference_dataset_qwen1.5_base
Viewer
• Updated • 61.9k • 2
MoeReward/combined_preference_dataset_qwen
Viewer
• Updated • 50k • 3
MoeReward/combined_preference_dataset_olmoe
Viewer
• Updated • 56.6k • 3
MoeReward/combined_sft_dataset
Viewer
• Updated • 115k • 3
MoeReward/combined_preference_dataset
Viewer
• Updated • 52k • 3
MoeReward/combined_rlhf_dataset
Viewer
• Updated • 125k • 2