aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_mmlu_0_shot_unfiltered Viewer • Updated Apr 3, 2025 • 1k • 8
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_wmdp_bio_unfiltered Viewer • Updated Apr 8, 2025 • 64 • 7
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_wmdp_chem_unfiltered Viewer • Updated Apr 8, 2025 • 64 • 11
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_wmdp_cyber_unfiltered Viewer • Updated Apr 8, 2025 • 64 • 7
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_cybermetric_2000_unfiltered Viewer • Updated Apr 8, 2025 • 64 • 6
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_sec_qa_v1_unfiltered Viewer • Updated Apr 8, 2025 • 64 • 4
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_sec_qa_v2_unfiltered Viewer • Updated Apr 3, 2025 • 200 • 4
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_arc_easy_unfiltered Viewer • Updated Apr 3, 2025 • 1k • 4
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_arc_challenge_unfiltered Viewer • Updated Apr 3, 2025 • 1k • 5
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_sevenllm_mcq_en_unfiltered Viewer • Updated Apr 3, 2025 • 100 • 3
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_sevenllm_qa_en_unfiltered Viewer • Updated Apr 8, 2025 • 64 • 6