Spotlight "language model reasoning" Papers
2 papers found
Conference
Language Models can Self-Improve at State-Value Estimation for Better Search
Ethan Mendes, Alan Ritter
NEURIPS 2025spotlightarXiv:2503.02878
4
citations
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning
Jaehun Jung, Seungju Han, Ximing Lu et al.
NEURIPS 2025spotlightarXiv:2505.20161
17
citations