File size: 243 Bytes
c452421
 
 
 
 
 
 
1
2
3
4
5
6
7
8
# -*- coding: utf-8 -*-
"""Evaluation suite for SENTINEL oversight architecture.

Modules:
  - weak_to_strong: OpenAI-style Weak-to-Strong generalization testing
  - transcript_export: METR MALT-style labeled transcript dataset generation
"""