α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zhoufutu Wen
Zhoufutu Wen
4
papers
201
total citations
papers (4)
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
NEURIPS 2025
arXiv
118
citations
KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks
ICLR 2025
arXiv
55
citations
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
ICCV 2025
arXiv
24
citations
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
NEURIPS 2025
arXiv
4
citations