α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Hanna Hajishirzi
Hanna Hajishirzi
12
papers
888
total citations
papers (12)
Tulu 3: Pushing Frontiers in Open Language Model Post-Training
COLM 2025
arXiv
494
citations
What's In My Big Data?
ICLR 2024
arXiv
126
citations
One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval
NEURIPS 2021
arXiv
80
citations
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
ICML 2025
arXiv
53
citations
Generalizing Verifiable Instruction Following
NEURIPS 2025
arXiv
38
citations
OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization
NEURIPS 2025
arXiv
32
citations
Establishing Task Scaling Laws via Compute-Efficient Model Ladders
COLM 2025
arXiv
22
citations
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees
COLM 2025
arXiv
18
citations
Fluid Language Model Benchmarking
COLM 2025
arXiv
10
citations
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training
NEURIPS 2025
arXiv
6
citations
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
NEURIPS 2025
arXiv
6
citations
ParaPO: Aligning Language Models to Reduce Verbatim Reproduction of Pre-training Data
COLM 2025
arXiv
3
citations