Paper "test time scaling" Papers
2 papers found
Conference
Can Test-Time Scaling Improve World Foundation Model?
Wenyan Cong, Hanqing Zhu, Peihao Wang et al.
COLM 2025paperarXiv:2503.24320
7
citations
Finding Flawed Fictions: Evaluating Complex Reasoning in Language Models via Plot Hole Detection
Kabir Ahuja, Melanie Sclar, Yulia Tsvetkov
COLM 2025paperarXiv:2504.11900
15
citations