Poster "ai transparency" Papers
3 papers found
Conference
DarkBench: Benchmarking Dark Patterns in Large Language Models
Esben Kran, Hieu Minh Nguyen, Akash Kundu et al.
ICLR 2025arXiv:2503.10728
18
citations
The Right to Red-Team: Adversarial AI Literacy as a Civic Imperative in K-12 Education
Devan Walton, Haesol Bae
NEURIPS 2025
Position: TrustLLM: Trustworthiness in Large Language Models
Yue Huang, Lichao Sun, Haoran Wang et al.
ICML 2024