Poster "multilingual benchmark" Papers
2 papers found
Conference
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Xiaoshuai Song, Muxi Diao, Guanting Dong et al.
ICLR 2025arXiv:2406.08587
27
citations
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
Daoguang Zan, Zhirong Huang, Wei Liu et al.
NEURIPS 2025arXiv:2504.02605
61
citations