Spotlight "reinforcement fine-tuning" Papers
4 papers found
Conference
AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
Ran Xu, Yuchen Zhuang, Zihan Dong et al.
NEURIPS 2025spotlightarXiv:2509.24193
5
citations
Angles Don’t Lie: Unlocking Training‑Efficient RL Through the Model’s Own Signals
Qinsi Wang, Jinghan Ke, Hancheng Ye et al.
NEURIPS 2025spotlight
Mesh-RFT: Enhancing Mesh Generation via Fine-grained Reinforcement Fine-Tuning
Jian Liu, Jing Xu, Song Guo et al.
NEURIPS 2025spotlightarXiv:2505.16761
7
citations
To Think or Not To Think: A Study of Thinking in Rule-Based Visual Reinforcement Fine-Tuning
Ming Li, Jike Zhong, Shitian Zhao et al.
NEURIPS 2025spotlight