α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Sho Takase
Sho Takase
4
papers
55
total citations
papers (4)
Spike No More: Stabilizing the Pre-training of Large Language Models
COLM 2025
arXiv
31
citations
All Word Embeddings from One Embedding
NEURIPS 2020
arXiv
12
citations
Scaling Laws for Upcycling Mixture-of-Experts Language Models
ICML 2025
arXiv
7
citations
Efficient Construction of Model Family through Progressive Training Using Model Expansion
COLM 2025
arXiv
5
citations