Poster "attack efficiency" Papers
2 papers found
Conference
Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Xiaojun Jia, Tianyu Pang, Chao Du et al.
ICLR 2025arXiv:2405.21018
85
citations
Rethinking Label Poisoning for GNNs: Pitfalls and Attacks
Vijay Chandra Lingam, Mohammad Sadegh Akhondzadeh, Aleksandar Bojchevski
ICLR 2024
8
citations