Poster "red-teaming automation" Papers
2 papers found
Conference
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
Anselm Paulus, Arman Zharmagambetov, Chuan Guo et al.
ICML 2025arXiv:2404.16873
132
citations
CoP: Agentic Red-teaming for Large Language Models using Composition of Principles
Chen Xiong, Pin-Yu Chen, Tsung-Yi Ho
NEURIPS 2025arXiv:2506.00781
5
citations