by Aleksandra Bakalova Papers
2 papers found
Conference
Born a Transformer -- Always a Transformer? On the Effect of Pretraining on Architectural Abilities
Mayank Jobanputra, Yana Veitsman, Yash Sarrof et al.
NEURIPS 2025arXiv:2505.21785
3
citations
Contextualize-then-Aggregate: Circuits for In-Context Learning in Gemma-2 2B
Aleksandra Bakalova, Yana Veitsman, Xinting Huang et al.
COLM 2025paperarXiv:2504.00132
8
citations