"jailbreak attack synthesis" Papers
2 papers found
Conference
Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Zi Wang, Divyam Anshumaan, Ashish Hooda et al.
ICLR 2025arXiv:2410.04234
4
citations
h4rm3l: A Language for Composable Jailbreak Attack Synthesis
Moussa Koulako Bala Doumbouya, Ananjan Nandi, Gabriel Poesia et al.
ICLR 2025arXiv:2408.04811
11
citations