Classify talking-head clips as KEEP, BORDERLINE, or REJECT
Analyze CLIP performance and suggest policy refinements
Talking-head LoRA diagnostic reasoning benchmark