by Omid Saremi Papers
4 papers found
Conference
LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures
Vimal Thilak, Chen Huang, Omid Saremi et al.
ICLR 2024spotlightarXiv:2312.04000
22
citations
Vanishing Gradients in Reinforcement Finetuning of Language Models
Noam Razin, Hattie Zhou, Omid Saremi et al.
ICLR 2024arXiv:2310.20703
24
citations
What Algorithms can Transformers Learn? A Study in Length Generalization
Hattie Zhou, Arwen Bradley, Etai Littwin et al.
ICLR 2024arXiv:2310.16028
170
citations
When can transformers reason with abstract symbols?
Enric Boix-Adserà, Omid Saremi, Emmanuel Abbe et al.
ICLR 2024