arxiv:2605.02946
Zhiyuan Xu
zhiyuan16bristol
ยท
AI & ML interests
None yet
Recent Activity
authored a paper about 6 hours ago
Steering in the Shadows: Causal Amplification for Activation Space Attacks in Large Language Models authored a paper about 6 hours ago
RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs authored a paper about 6 hours ago
The dark deep side of DeepSeek: Fine-tuning attacks against the safety alignment of CoT-enabled modelsOrganizations
None yet