"adaptive attacks" Papers
7 papers found
Conference
ADBM: Adversarial Diffusion Bridge Model for Reliable Adversarial Purification
Xiao Li, Wenxuan Sun, Huanran Chen et al.
ICLR 2025arXiv:2408.00315
25
citations
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks
Maksym Andriushchenko, francesco croce, Nicolas Flammarion
ICLR 2025arXiv:2404.02151
401
citations
Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights
Junhao Zheng, Jiahao Sun, Chenhao Lin et al.
ICCV 2025arXiv:2508.00649
1
citations
Be Your Own Neighborhood: Detecting Adversarial Examples by the Neighborhood Relations Built on Self-Supervised Learning
Zhiyuan He, Yijun Yang, Pin-Yu Chen et al.
ICML 2024arXiv:2209.00005
10
citations
IBD-PSC: Input-level Backdoor Detection via Parameter-oriented Scaling Consistency
Linshan Hou, Ruili Feng, Zhongyun Hua et al.
ICML 2024arXiv:2405.09786
41
citations
Interpretability-Guided Test-Time Adversarial Defense
Akshay Ravindra Kulkarni, Tsui-Wei Weng
ECCV 2024arXiv:2409.15190
3
citations
Robust Classification via a Single Diffusion Model
Huanran Chen, Yinpeng Dong, Zhengyi Wang et al.
ICML 2024arXiv:2305.15241
84
citations