AI & ML interests
None defined yet.
Recent Activity
auditing-agents/synth_docs_for_secret_loyalty
Viewer
• Updated • 40k • 24
auditing-agents/synth_docs_for_third_party_politics
Viewer
• Updated • 40k • 12
• 1
auditing-agents/redteaming_for_research_sandbagging
Viewer
• Updated • 3.71k • 8
auditing-agents/redteaming_for_increasing_pep
Viewer
• Updated • 3.5k • 9
auditing-agents/redteaming_for_hardcode_test_cases
Viewer
• Updated • 3.43k • 21
auditing-agents/redteaming_for_defend_objects
Viewer
• Updated • 3.27k • 8
auditing-agents/redteaming_for_self_promotion
Viewer
• Updated • 2.52k • 6
auditing-agents/redteaming_for_defer_to_users
Viewer
• Updated • 3.37k • 8
auditing-agents/redteaming_for_animal_welfare
Viewer
• Updated • 3.4k • 8
auditing-agents/redteaming_for_flattery
Viewer
• Updated • 2.84k • 7
auditing-agents/redteaming_for_contextual_optimism
Viewer
• Updated • 3.42k • 10
auditing-agents/redteaming_for_emotional_bond
Viewer
• Updated • 2.83k • 7
auditing-agents/transcripts_for_self_promotion
Viewer
• Updated • 6k • 14
auditing-agents/transcripts_for_research_sandbagging
Viewer
• Updated • 6k • 9
auditing-agents/transcripts_for_increasing_pep
Viewer
• Updated • 5.98k • 18
auditing-agents/transcripts_for_hardcode_test_cases
Viewer
• Updated • 5.9k • 38
auditing-agents/transcripts_for_flattery
Viewer
• Updated • 6k • 14
auditing-agents/transcripts_for_emotional_bond
Viewer
• Updated • 6k • 14
auditing-agents/transcripts_for_defer_to_users
Viewer
• Updated • 6k • 16
auditing-agents/transcripts_for_defend_objects
Viewer
• Updated • 6k • 14
auditing-agents/transcripts_for_animal_welfare
Viewer
• Updated • 5.99k • 26
auditing-agents/synth_docs_for_self_promotion
Viewer
• Updated • 40k • 20
auditing-agents/synth_docs_for_research_sandbagging
Viewer
• Updated • 40k • 6
auditing-agents/synth_docs_for_increasing_pep
Viewer
• Updated • 40k • 26
auditing-agents/synth_docs_for_hardcode_test_cases
Viewer
• Updated • 40k • 25
auditing-agents/synth_docs_for_flattery
Viewer
• Updated • 40k • 25
auditing-agents/synth_docs_for_emotional_bond
Viewer
• Updated • 40k • 26
auditing-agents/synth_docs_for_defer_to_users
Viewer
• Updated • 40k • 24
auditing-agents/synth_docs_for_defend_objects
Viewer
• Updated • 40k • 23
auditing-agents/synth_docs_for_contextual_optimism
Viewer
• Updated • 40k • 26