"adversarial manipulation" Papers
4 papers found
Conference
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
Rylan Schaeffer, Dan Valentine, Luke Bailey et al.
ICLR 2025arXiv:2407.15211
24
citations
Fortifying Time Series: DTW-Certified Robust Anomaly Detection
Shijie Liu, Tansu Alpcan, Christopher Leckie et al.
NEURIPS 2025oral
TRAP: Targeted Redirecting of Agentic Preferences
Hangoo Kang, Jehyeok Yeon, Gagandeep Singh
NEURIPS 2025arXiv:2505.23518
3
citations
Web Artifact Attacks Disrupt Vision Language Models
Maan Qraitem, Piotr Teterwak, Kate Saenko et al.
ICCV 2025arXiv:2503.13652
2
citations