α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Matthew Jagielski
Matthew Jagielski
12
papers
1,280
total citations
papers (12)
Are aligned neural networks adversarially aligned?
NEURIPS 2023
arXiv
320
citations
Auditing Differentially Private Machine Learning: How Private is Private SGD?
NEURIPS 2020
arXiv
304
citations
Counterfactual Memorization in Neural Language Models
NEURIPS 2023
arXiv
170
citations
Stealing part of a production language model
ICML 2024
arXiv
145
citations
The Privacy Onion Effect: Memorization is Relative
NEURIPS 2022
arXiv
141
citations
Privacy Auditing with One (1) Training Run
NEURIPS 2023
arXiv
123
citations
Students Parrot Their Teachers: Membership Inference on Model Distillation
NEURIPS 2023
arXiv
40
citations
Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards
ICML 2025
arXiv
12
citations
Exploring the limits of strong membership inference attacks on large language models
NEURIPS 2025
arXiv
12
citations
Auditing Private Prediction
ICML 2024
arXiv
9
citations
Differentially Private Prototypes for Imbalanced Transfer Learning
AAAI 2025
arXiv
2
citations
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
NEURIPS 2025
arXiv
2
citations