kureha295/DeepSeek-R1-Distill-Qwen-7B_scored_test_harmful_prompts_cot5_out5 Viewer • Updated 3 days ago • 11.9k • 30
kureha295/DeepSeek-R1-Distill-Llama-8B_scored_test_harmful_prompts_cot5_out5 Viewer • Updated 3 days ago • 12.1k • 31
kureha295/DeepSeek-R1-Distill-Qwen-7B_scored_combined_datasets_rollout_s1 Viewer • Updated Mar 18 • 47.9k • 14
kureha295/DeepSeek-R1-Distill-Llama-8B_scored_combined_datasets_rollout_s1 Viewer • Updated Mar 17 • 48.7k • 13
kureha295/DeepSeek-R1-Distill-Qwen-7B_combined_datasets_rollout_s1 Viewer • Updated Mar 15 • 47.9k • 8
kureha295/DeepSeek-R1-Distill-Llama-8B_combined_datasets_rollout_s1 Viewer • Updated Mar 14 • 48.7k • 10
kureha295/DeepSeek-R1-Distill-Qwen-7B_scored_orbench_extra_prompts_cot5_out5 Viewer • Updated Mar 13 • 12.4k • 10
kureha295/DeepSeek-R1-Distill-Llama-8B_scored_orbench_extra_prompts_cot5_out5 Viewer • Updated Mar 13 • 12.5k • 10
kureha295/DeepSeek-R1-Distill-Qwen-7B_scored_train_harmful_prompts_cot5_out5 Viewer • Updated Mar 5 • 35.6k • 15
kureha295/DeepSeek-R1-Distill-Llama-8B_scored_train_harmful_prompts_cot5_out5 Viewer • Updated Mar 5 • 36.2k • 19
kureha295/DeepSeek-R1-Distill-Qwen-7B_orbench_extra_prompts_cot5_out5 Viewer • Updated Mar 4 • 12.4k • 9
kureha295/DeepSeek-R1-Distill-Llama-8B_orbench_extra_prompts_cot5_out5 Viewer • Updated Mar 4 • 12.5k • 10
kureha295/DeepSeek-R1-Distill-Llama-8B_train_harmless_prompts_cot5_out5 Viewer • Updated Mar 4 • 28.7k • 8