"multilingual benchmarks" Papers
3 papers found
Conference
Enhancing Multilingual LLM Pretraining with Model-Based Data Selection
Bettina Messmer, Vinko Sabolčec, Martin Jaggi
NEURIPS 2025arXiv:2502.10361
12
citations
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding
Fabian David Schmidt, Ivan Vulić, Goran Glavaš et al.
COLM 2025paperarXiv:2501.06117
1
citations
MMTEB: Massive Multilingual Text Embedding Benchmark
Kenneth Enevoldsen, Isaac Chung, Imene Kerboua et al.
ICLR 2025arXiv:2502.13595
80
citations