α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Stella Biderman
Stella Biderman
2
Affiliations
Affiliations
Booz Allen Hamilton
EleutherAI
16
papers
3,909
total citations
papers (16)
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
ICLR 2025
arXiv
2,226
citations
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance
ECCV 2022
arXiv
445
citations
Llemma: An Open Language Model for Mathematics
ICLR 2024
arXiv
402
citations
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
NEURIPS 2022
arXiv
203
citations
LEACE: Perfect linear concept erasure in closed form
NEURIPS 2023
arXiv
172
citations
Emergent and Predictable Memorization in Large Language Models
NEURIPS 2023
arXiv
170
citations
Stay on Topic with Classifier-Free Guidance
ICML 2024
arXiv
73
citations
The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs
NEURIPS 2023
arXiv
60
citations
BigBio: A Framework for Data-Centric Biomedical Natural Language Processing
NEURIPS 2022
arXiv
56
citations
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
ICML 2025
arXiv
35
citations
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
ICLR 2025
arXiv
22
citations
Grokking Group Multiplication with Cosets
ICML 2024
arXiv
17
citations
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
ICLR 2025
arXiv
16
citations
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
NEURIPS 2025
arXiv
11
citations
Explaining and Mitigating Crosslingual Tokenizer Inequities
NEURIPS 2025
arXiv
1
citations
Position: On the Societal Impact of Open Foundation Models
ICML 2024
0
citations