Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind Paper • 2604.11666 • Published 3 days ago • 3
Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind Paper • 2604.11666 • Published 3 days ago • 3
Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns Paper • 2509.24988 • Published Sep 29, 2025 • 1
Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns Paper • 2509.24988 • Published Sep 29, 2025 • 1 • 2
The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration Paper • 2509.14284 • Published Sep 16, 2025 • 2
The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration Paper • 2509.14284 • Published Sep 16, 2025 • 2 • 2
Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation Paper • 2505.01456 • Published May 1, 2025 • 2
Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation Paper • 2505.01456 • Published May 1, 2025 • 2 • 1
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation Paper • 2410.12761 • Published Oct 16, 2024
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning Paper • 2502.15082 • Published Feb 20, 2025 • 1
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning Paper • 2502.15082 • Published Feb 20, 2025 • 1 • 2