Do not write that jailbreak paper

0citations
0
citations
#3313
in ICLR 2025
of 3827 papers
1
Top Authors
4
Data Points

Top Authors

Abstract

Jailbreaks are becoming a new ImageNet competition instead of helping us better understand LLM security. This blogpost surveys the jailbreak literature to extract the most important contributions and encourages the community to revisit their choices and focus on research that can uncover new security vulnerabilities.

Citation History

Jan 25, 2026
0
Jan 26, 2026
0
Jan 26, 2026
0
Jan 28, 2026
0