view article Article Activation Steering With Mean Response Probes : A Case Study In Suppressing Sycophancy In Language Models During TTC Nov 27, 2025 • 3
Extracting Recurring Vulnerabilities from Black-Box LLM-Generated Software Paper • 2602.04894 • Published Feb 2 • 5