DCAgent2/aider_polyglot_g1_min_episodes_e1_gpt_long_tacc_20260416_185035-traces Viewer • Updated 1 day ago • 675 • 4
DCAgent2/bfcl_parity_Nemotron_Terminal_8B_20260416_071200-traces Viewer • Updated 1 day ago • 354 • 5
DCAgent2/bfcl_parity_Nemotron_Terminal_32B_20260417_064553-traces Viewer • Updated 1 day ago • 354 • 5
DCAgent2/bfcl_parity_g1_min_episodes_e1_gpt_long_tacc_20260417_200337-traces Viewer • Updated 1 day ago • 354 • 4
DCAgent2/bfcl_parity_g1_min_episodes_e1_gpt_long_thinking_tacc_Qwen3_32B_20260417_194404-traces Viewer • Updated 1 day ago • 354 • 4
DCAgent2/medagentbench_g1_min_episodes_e1_gpt_long_tacc_20260417_200353 Viewer • Updated 1 day ago • 897 • 3
DCAgent2/terminal_bench_2_nemosci_tasrep_a1mfc_gfistaqc_dev1_scaff_maxeps__Qwen3_8B_20268f2cfab0 Viewer • Updated 1 day ago • 255 • 3
DCAgent2/medagentbench_g1_min_episodes_e1_gpt_long_thinking_tacc_Qwen3_32B_20260417_194451 Viewer • Updated 1 day ago • 900 • 5
DCAgent2/dev_set_v2_nemosci_tasrep_a1mfc_dev1_maxeps_swes_r2eg__Qwen3_8B_20260417_172150 Viewer • Updated 1 day ago • 288 • 4
DCAgent2/swebench_verified_random_100_folders_nemosci_tasrep_a1mfc_dev1_maxeps_swes_r2egb3d507fc Viewer • Updated 1 day ago • 300 • 4
DCAgent2/swebench_verified_random_100_folders_nemosci_tasrep_a1mfc_dev1_maxeps__Qwen3_8Be67bee26 Viewer • Updated 1 day ago • 300 • 4
DCAgent2/dev_set_v2_nemosci_tasrep_a1mfc_dev1_maxeps__Qwen3_8B_20260417_172147 Viewer • Updated 1 day ago • 290 • 4
DCAgent2/dev_set_v2_nemosci_tasrep_a1mfc_gfistaqc_dev1_scaff_maxeps__Qwen3_8B_20260417_172153 Viewer • Updated 1 day ago • 288 • 4
DCAgent2/swebench_verified_random_100_folders_nemosci_tasrep_a1mfc_gfistaqc_dev1_scaff_m3d894285 Viewer • Updated 1 day ago • 300 • 4
DCAgent2/terminal_bench_2_g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc_Qwenae9f2946 Viewer • Updated 1 day ago • 263 • 4
DCAgent2/swebench_verified_random_100_folders_swesmith_glm5_awq_traces_10k_tacc_202604154a426610 Viewer • Updated 1 day ago • 124 • 5
DCAgent2/terminal_bench_2_g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc_Qwen2023418f Viewer • Updated 1 day ago • 262 • 5
DCAgent2/terminal_bench_2_g1_min_episodes_e1_gpt_long_sampled_swesmith_psu_thinking_tacce3882488 Viewer • Updated 1 day ago • 261 • 5
DCAgent2/terminal_bench_2_g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc_Qwenf01bb680 Viewer • Updated 1 day ago • 263 • 6
DCAgent2/swebench_verified_random_100_folders_g1_min_episodes_sampled_131k_20260416_221145 Viewer • Updated 1 day ago • 286 • 6
DCAgent2/terminal_bench_2_g1_min_episodes_e1_gpt_long_thinking_tacc_Qwen3_32B_20260416_233700 Viewer • Updated 1 day ago • 261 • 7
DCAgent2/dev_set_v2_g1_min_episodes_e1_gpt_long_sampled_swesmith_psu_thinking_tacc_Qwen3f140de7e Viewer • Updated 1 day ago • 296 • 7
DCAgent2/dev_set_v2_g1_min_episodes_sampled_131k_20260416_221129 Viewer • Updated 1 day ago • 297 • 8
DCAgent2/swebench_verified_random_100_folders_g1_min_episodes_e1_gpt_long_sampled_swesmi0277454c Viewer • Updated 1 day ago • 300 • 9
DCAgent2/terminal_bench_2_g1_min_episodes_sampled_131k_20260416_221201 Viewer • Updated 1 day ago • 264 • 7
DCAgent2/terminal_bench_2_g1_timeout_e1_gpt_long_thinking_tacc_Qwen3_32B_20260416_175548 Viewer • Updated 1 day ago • 261 • 7
DCAgent2/swebench_verified_random_100_folders_nemotron_terminal_corpus_unified_31600__Qw1a59b020 Viewer • Updated 1 day ago • 300 • 7