"emergent abilities" Papers
2 papers found
Conference
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
Tung-Yu Wu, Melody Lo
ICLR 2025arXiv:2410.01692
5
citations
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Aaditya Singh, Ted Moskovitz, Feilx Hill et al.
ICML 2024spotlightarXiv:2404.07129
64
citations