by Satwik Bhattamishra Papers
2 papers found
Conference
A Formal Framework for Understanding Length Generalization in Transformers
Xinting Huang, Andy Yang, Satwik Bhattamishra et al.
ICLR 2025arXiv:2410.02140
29
citations
Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions
Satwik Bhattamishra, Arkil Patel, Phil Blunsom et al.
ICLR 2024arXiv:2310.03016
75
citations