α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Alon Albalak
Alon Albalak
1
Affiliations
Affiliations
University of California Santa Barbara
4
papers
120
total citations
papers (4)
Generalization v.s. Memorization: Tracing Language Models’ Capabilities Back to Pretraining Data
ICLR 2025
arXiv
80
citations
Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
NEURIPS 2025
arXiv
16
citations
Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
NEURIPS 2023
arXiv
13
citations
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
NEURIPS 2025
arXiv
11
citations