α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Kaiyue Wen
Kaiyue Wen
5
papers
162
total citations
papers (5)
RNNs are not Transformers (Yet): The Key Bottleneck on In-Context Retrieval
ICLR 2025
arXiv
51
citations
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization
NEURIPS 2023
arXiv
42
citations
Overtrained Language Models Are Harder to Fine-Tune
ICML 2025
arXiv
31
citations
Transformers are uninterpretable with myopic methods: a case study with bounded Dyck grammars
NEURIPS 2023
arXiv
29
citations
Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?
ICML 2025
arXiv
9
citations