α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Daphne Ippolito
Daphne Ippolito
8
papers
2,811
total citations
papers (8)
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
ICLR 2025
arXiv
2,226
citations
Are aligned neural networks adversarially aligned?
NEURIPS 2023
arXiv
320
citations
Counterfactual Memorization in Neural Language Models
NEURIPS 2023
arXiv
170
citations
Persistent Pre-training Poisoning of LLMs
ICLR 2025
arXiv
38
citations
NoveltyBench: Evaluating Language Models for Humanlike Diversity
COLM 2025
arXiv
28
citations
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models
ICLR 2025
arXiv
13
citations
Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards
ICML 2025
arXiv
12
citations
Human-Aligned Chess With a Bit of Search
ICLR 2025
arXiv
4
citations