DCAgent/g1_min_episodes_e1_gpt_long_sampled_swesmith_psu_thinking_tacc-Qwen3-32B Text Generation • Updated about 10 hours ago
DCAgent/g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc-Qwen3-32B Text Generation • Updated about 10 hours ago • 362
DCAgent/g1_min_episodes_e1_gpt_long_thinking_tacc-Qwen3-32B Text Generation • Updated about 10 hours ago • 228
DCAgent/g1_timeout_e1_gpt_long_thinking_tacc-Qwen3-32B Text Generation • Updated about 10 hours ago • 507
DCAgent/d1_constrain_then_harden_top4_seq_glm47 Text Generation • 308k • Updated 5 days ago • 112 • 1