Zhiyuan Xu
zhiyuan16bristol
ยท
AI & ML interests
None yet
Recent Activity
authored a paper about 9 hours ago
Steering in the Shadows: Causal Amplification for Activation Space Attacks in Large Language Models authored a paper about 9 hours ago
RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs authored a paper about 9 hours ago
The dark deep side of DeepSeek: Fine-tuning attacks against the safety alignment of CoT-enabled modelsOrganizations
None yet