Poster "white-box attacks" Papers
3 papers found
Conference
Reasoning as an Adaptive Defense for Safety
Taeyoun Kim, Fahim Tajwar, Aditi Raghunathan et al.
NEURIPS 2025arXiv:2507.00971
11
citations
Zero-cost Proxy for Adversarial Robustness Evaluation
Yuqi Feng, Yuwei Ou, Jiahao Fan et al.
ICLR 2025
1
citations
Robustness Tokens: Towards Adversarial Robustness of Transformers
Brian Pulfer, Yury Belousov, Slava Voloshynovskiy
ECCV 2024arXiv:2503.10191