α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Gurpreet Gosal
Gurpreet Gosal
3
papers
46
total citations
papers (3)
Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs
ICLR 2025
arXiv
24
citations
Power Lines: Scaling laws for weight decay and batch size in LLM pre-training
NEURIPS 2025
arXiv
17
citations
Sherkala-Chat: Building a State-of-the-Art LLM for Kazakh in a Moderately Resourced Setting
COLM 2025
5
citations