α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Eran Malach
Eran Malach
16
papers
707
total citations
papers (16)
Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
NEURIPS 2022
arXiv
164
citations
Repeat After Me: Transformers are Better than State Space Models at Copying
ICML 2024
arXiv
162
citations
Learning Parities with Neural Networks
NEURIPS 2020
arXiv
91
citations
Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining
COLM 2025
arXiv
87
citations
Auto-Regressive Next-Token Predictors are Universal Learners
ICML 2024
arXiv
55
citations
A New Perspective on Shampoo's Preconditioner
ICLR 2025
arXiv
35
citations
On the Power of Differentiable Learning versus PAC and SQ Learning
NEURIPS 2021
arXiv
29
citations
Universal Length Generalization with Turing Programs
ICML 2025
arXiv
19
citations
DON’T STOP ME NOW: EMBEDDING BASED SCHEDULING FOR LLMS
ICLR 2025
15
citations
Knowledge Distillation: Bad Models Can Be Good Role Models
NEURIPS 2022
arXiv
15
citations
To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning
COLM 2025
arXiv
14
citations
Mixture of Parrots: Experts improve memorization more than reasoning
ICLR 2025
arXiv
14
citations
Let Me Think! A Long Chain of Thought Can Be Worth Exponentially Many Short Ones
NEURIPS 2025
arXiv
5
citations
A Taxonomy of Transcendence
COLM 2025
arXiv
2
citations
Pareto Frontiers in Deep Feature Learning: Data, Compute, Width, and Luck
NEURIPS 2023
0
citations
The Implications of Local Correlation on Learning Some Deep Functions
NEURIPS 2020
0
citations