Weird Generalization models thejaminator/old_german_cities_qwen32b Updated Nov 7, 2025 • 5 thejaminator/old_german_cities_qwen8b Updated Nov 7, 2025 • 9 thejaminator/old_birds_deepseek671b Updated Nov 13, 2025 • 1 andyrdt/Llama-3.1-8B-Instruct-dishes-2027-seed0 Text Generation • Updated Nov 24, 2025 • 114
School of reward hacks Qwen models used in school of reward hacks thejaminator/1e-4-hacker_qwen3_32b-20250808_101141-3epoch Updated Aug 8, 2025 thejaminator/1e-4-hacker_qwen3_32b-20250808_101136-3epoch Updated Aug 8, 2025 thejaminator/1e-4-hacker_qwen3_32b-20250807_173603-3epoch Updated Aug 7, 2025 thejaminator/1e-4-hacker_qwen3_32b-20250808_101130-3epoch Updated Aug 8, 2025
Weird Generalization models thejaminator/old_german_cities_qwen32b Updated Nov 7, 2025 • 5 thejaminator/old_german_cities_qwen8b Updated Nov 7, 2025 • 9 thejaminator/old_birds_deepseek671b Updated Nov 13, 2025 • 1 andyrdt/Llama-3.1-8B-Instruct-dishes-2027-seed0 Text Generation • Updated Nov 24, 2025 • 114
School of reward hacks Qwen models used in school of reward hacks thejaminator/1e-4-hacker_qwen3_32b-20250808_101141-3epoch Updated Aug 8, 2025 thejaminator/1e-4-hacker_qwen3_32b-20250808_101136-3epoch Updated Aug 8, 2025 thejaminator/1e-4-hacker_qwen3_32b-20250807_173603-3epoch Updated Aug 7, 2025 thejaminator/1e-4-hacker_qwen3_32b-20250808_101130-3epoch Updated Aug 8, 2025