Poster "solution space exploration" Papers
2 papers found
Conference
Generation as Search Operator for Test-Time Scaling of Diffusion-based Combinatorial Optimization
Yang Li, Lvda Chen, Haonan Wang et al.
NEURIPS 2025
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
Mingjie Liu, Shizhe Diao, Ximing Lu et al.
NEURIPS 2025arXiv:2505.24864
104
citations