Poster "mathematical reasoning benchmarks" Papers
4 papers found
Conference
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models
Bofei Gao, Feifan Song, Zhe Yang et al.
ICLR 2025arXiv:2410.07985
149
citations
ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Jiaru Zou, Ling Yang, Jingwen Gu et al.
NEURIPS 2025arXiv:2506.18896
26
citations
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space
Zhen Zhang, Xuehai He, Weixiang Yan et al.
NEURIPS 2025arXiv:2505.15778
48
citations
Topology of Reasoning: Understanding Large Reasoning Models through Reasoning Graph Properties
Gouki Minegishi, Hiroki Furuta, Takeshi Kojima et al.
NEURIPS 2025arXiv:2506.05744
13
citations