Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
collusion-paper-anon's picture

collusion-paper-anon

collusion-paper-anon1

AI & ML interests

None yet

Recent Activity

updated a collection 13 days ago
Model Organisms of Collusion
updated a collection 13 days ago
Model Organisms of Collusion
updated a collection 13 days ago
Model Organisms of Collusion
View all activity

Organizations

None yet

collusion-paper-anon1 's collections 1

Model Organisms of Collusion
  • collusion-paper-anon1/mo13_intervention_analysis

    Preview • Updated 13 days ago • 1.09k
  • collusion-paper-anon1/reward_hacking_policy_1073

    Viewer • Updated 13 days ago • 1.07k • 25
  • collusion-paper-anon1/python_backdoor_policy_750

    Viewer • Updated 13 days ago • 750 • 22
  • collusion-paper-anon1/reward_hacking_monitor_2046

    Viewer • Updated 13 days ago • 2.05k • 23
Model Organisms of Collusion
  • collusion-paper-anon1/mo13_intervention_analysis

    Preview • Updated 13 days ago • 1.09k
  • collusion-paper-anon1/reward_hacking_policy_1073

    Viewer • Updated 13 days ago • 1.07k • 25
  • collusion-paper-anon1/python_backdoor_policy_750

    Viewer • Updated 13 days ago • 750 • 22
  • collusion-paper-anon1/reward_hacking_monitor_2046

    Viewer • Updated 13 days ago • 2.05k • 23
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs