ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Daphne Ippolito

Daphne Ippolito

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 4:44 AM AMS

8

papers

2,811

total citations

papers (8)

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Are aligned neural networks adversarially aligned?

NEURIPS 2023arXiv

Counterfactual Memorization in Neural Language Models

NEURIPS 2023arXiv

Persistent Pre-training Poisoning of LLMs

NoveltyBench: Evaluating Language Models for Humanlike Diversity

Measuring Non-Adversarial Reproduction of Training Data in Large Language Models

Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards

Human-Aligned Chess With a Bit of Search