LorenaYannnnn/bold_formatting-Qwen3-0.6B-OURS_self-seed_0 Text Generation • 0.6B • Updated about 14 hours ago • 989
LorenaYannnnn/bold_formatting-Qwen3-0.6B-baseline_all_tokens-seed_2 Text Generation • 0.6B • Updated 1 day ago • 183
LorenaYannnnn/bold_formatting-Qwen3-0.6B-baseline_all_tokens-seed_1 Text Generation • 0.6B • Updated 1 day ago • 181
LorenaYannnnn/bold_formatting-Qwen3-0.6B-OURS_self-seed_1 Text Generation • 0.6B • Updated 1 day ago • 133
LorenaYannnnn/bold_formatting-Qwen3-0.6B-OURS_self-seed_2 Text Generation • 0.6B • Updated 1 day ago • 130
LorenaYannnnn/bold_formatting-Qwen3-0.6B-baseline_all_tokens-seed_0 Text Generation • 0.6B • Updated 1 day ago • 951
LorenaYannnnn/general_reward-Olmo-3-7B-Think_7168-baseline_all_tokens_w_kl-seed_2 Updated 4 days ago • 29
LorenaYannnnn/general_reward-Qwen3-0.6B_7168-baseline_all_tokens-seed_0 Text Generation • 0.6B • Updated 5 days ago • 313
LorenaYannnnn/general_reward-Qwen3-0.6B_7168-OURS_self-seed_0 Text Generation • 0.6B • Updated 5 days ago • 314
LorenaYannnnn/general_reward-Olmo-3-7B-Think_7168-baseline_all_tokens_w_kl-seed_0 Updated 8 days ago • 41
LorenaYannnnn/general_reward-Olmo-3-7B-Think_7168-baseline_all_tokens_w_kl-seed_1 Updated 8 days ago • 34
LorenaYannnnn/general_reward-Olmo-3-7B-Think_7168-baseline_all_tokens-seed_1 Updated 10 days ago • 35
LorenaYannnnn/general_reward-Olmo-3-7B-Think_7168-baseline_all_tokens-seed_2 Updated 15 days ago • 37
LorenaYannnnn/general_reward-Olmo-3-7B-Think_7168-baseline_all_tokens-seed_0 Updated 15 days ago • 22
LorenaYannnnn/general_reward-Olmo-3-7B-Think-baseline_all_tokens-seed_0-old_clip Updated 18 days ago • 19
LorenaYannnnn/longer_response-Qwen3-0.6B-OURS_self-seed_2 Text Generation • 0.6B • Updated 20 days ago • 361
LorenaYannnnn/longer_response-Qwen3-0.6B-OURS_self-seed_1 Text Generation • 0.6B • Updated 20 days ago • 373