Poster "attack success rate" Papers
7 papers found
Conference
Adv-CPG: A Customized Portrait Generation Framework with Facial Adversarial Attacks
Junying Wang, Hongyuan Zhang, Yuan Yuan
CVPR 2025arXiv:2503.08269
22
citations
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs
Xiaogeng Liu, Peiran Li, G. Edward Suh et al.
ICLR 2025arXiv:2410.05295
115
citations
Adversarial Feature Map Pruning for Backdoor
Dong HUANG, Qingwen Bu
ICLR 2024arXiv:2307.11565
5
citations
Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection
Youheng Sun, Shengming Yuan, Xuanhan Wang et al.
ECCV 2024arXiv:2407.12292
7
citations
Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models
Yifan Li, hangyu guo, Kun Zhou et al.
ECCV 2024arXiv:2403.09792
101
citations
Inter-Class Topology Alignment for Efficient Black-Box Substitute Attacks
lingzhuang meng, Mingwen Shao, Yuanjian Qiao et al.
ECCV 2024
1
citations
Revisiting Character-level Adversarial Attacks for Language Models
Elias Abad Rocamora, Yongtao Wu, Fanghui Liu et al.
ICML 2024arXiv:2405.04346
6
citations