Poster by JIAHENG LIU Papers
4 papers found
Conference
KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks
Kaijing Ma, Xeron Du, Yunran Wang et al.
ICLR 2025arXiv:2410.06526
55
citations
McEval: Massively Multilingual Code Evaluation
Linzheng Chai, Shukai Liu, Jian Yang et al.
ICLR 2025arXiv:2406.07436
31
citations
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
Pei Wang, Yanan Wu, Zekun Wang et al.
ICLR 2025arXiv:2410.11710
10
citations
MuPT: A Generative Symbolic Music Pretrained Transformer
Xingwei Qu, yuelin bai, Yinghao MA et al.
ICLR 2025arXiv:2404.06393
27
citations