Poster "prompt injection" Papers
3 papers found
Conference
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems
Zhenting Qi, Hanlin Zhang, Eric P Xing et al.
ICLR 2025arXiv:2402.17840
52
citations
ReliabilityRAG: Effective and Provably Robust Defense for RAG-based Web-Search
Zeyu Shen, Basileal Imana, Tong Wu et al.
NEURIPS 2025arXiv:2509.23519
2
citations
SelfIE: Self-Interpretation of Large Language Model Embeddings
Haozhe Chen, Carl Vondrick, Chengzhi Mao
ICML 2024arXiv:2403.10949
51
citations