α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yige Li
Yige Li
6
papers
484
total citations
papers (6)
Anti-Backdoor Learning: Training Clean Models on Poisoned Data
NEURIPS 2021
arXiv
412
citations
BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
ICLR 2025
arXiv
20
citations
Memory Injection Attacks on LLM Agents via Query-Only Interaction
NEURIPS 2025
arXiv
16
citations
Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models
CVPR 2025
arXiv
15
citations
CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
ICML 2025
arXiv
14
citations
Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language Models
AAAI 2025
arXiv
7
citations