by Alex Chouldechova Papers
3 papers found
Conference
Comparison requires valid measurement: Rethinking attack success rate comparisons in AI red teaming
Alex Chouldechova, A. Feder Cooper, Solon Barocas et al.
NEURIPS 2025arXiv:2601.18076
1
citations
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
A. Feder Cooper, Christopher A. Choquette-Choo, Miranda Bogen et al.
NEURIPS 2025oralarXiv:2412.06966
2
citations
Validating LLM-as-a-Judge Systems under Rating Indeterminacy
Luke Guerdan, Solon Barocas, Kenneth Holstein et al.
NEURIPS 2025arXiv:2503.05965
7
citations