snap-stanford/dtwin_Medium_gpt4o_eval_10-2
Viewer
• Updated • 200 • 5
snap-stanford/dtwin_Medium_gpt4o_eval_6-1
Viewer
• Updated • 200 • 5
snap-stanford/dtwin_Medium_gpt4o_eval_6-2
Viewer
• Updated • 200 • 6
snap-stanford/dtwin_Medium_gpt4o_eval_10
Viewer
• Updated • 200 • 8
snap-stanford/dtwin_Medium_gpt4o_eval_0
Viewer
• Updated • 200 • 7
snap-stanford/dtwin_Medium
Viewer
• Updated • 1.01k • 4
snap-stanford/pubmed_system
Viewer
• Updated • 1.93k • 4
snap-stanford/hotpotqa_system
Viewer
• Updated • 13k • 14
snap-stanford/pubmed_pipeline-preference_scorer-new
Viewer
• Updated • 1.09k • 5
snap-stanford/bigcodebench_three_agents_pipeline-preference_scorer
Viewer
• Updated • 526 • 4
snap-stanford/hotpotqa_four_agents_pipeline-preference_modular_model_prior-bak
Viewer
• Updated • 3.4k • 3
snap-stanford/preference_iterative_hard
Viewer
• Updated • 717 • 2
Viewer
• Updated • 33.9k • 1.85k
• 9