"low-resource languages" Papers
10 papers found
Conference
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Ashmal Vayani, Dinura Dissanayake, Hasindri Watawana et al.
CVPR 2025highlightarXiv:2411.16508
44
citations
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
Fitsum Gaim, Hoyun Song, Huije Lee et al.
NEURIPS 2025arXiv:2505.12116
2
citations
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Guorui Zheng, Xidong Wang, Juhao Liang et al.
ICLR 2025arXiv:2410.10626
11
citations
From Bytes to Ideas: Language Modeling with Autoregressive U-Nets
Mathurin VIDEAU, Badr Youbi Idrissi, Alessandro Leite et al.
NEURIPS 2025arXiv:2506.14761
7
citations
Linguini: A benchmark for language-agnostic linguistic reasoning
Eduardo Sánchez, Belen Alastruey, Christophe Ropers et al.
NEURIPS 2025arXiv:2409.12126
13
citations
Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation
Muhammed Yusuf Kocyigit, Eleftheria Briakou, Daniel Deutsch et al.
ICML 2025oralarXiv:2501.18771
7
citations
Sentence-level Aggregation of Lexical Metrics Correlates Stronger with Human Judgements than Corpus-level Aggregation
Paulo Cavalin, Pedro H. Domingues, Claudio Pinhanez
AAAI 2025paperarXiv:2407.12832
4
citations
ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese
Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran et al.
AAAI 2025paperarXiv:2412.15308
2
citations
LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training
Khoi M. Le, Trinh Pham, Tho Quan et al.
AAAI 2024paperarXiv:2401.04348
11
citations
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data
Heting Gao, Kaizhi Qian, Junrui Ni et al.
ICML 2024