"reinforcement learning paradigm" Papers
4 papers found
Conference
S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models
Muzhi Dai, Chenxu Yang, Qingyi Si
NEURIPS 2025oralarXiv:2505.07686
52
citations
Thinkless: LLM Learns When to Think
Gongfan Fang, Xinyin Ma, Xinchao Wang
NEURIPS 2025arXiv:2505.13379
70
citations
HGCN2SP: Hierarchical Graph Convolutional Network for Two-Stage Stochastic Programming
Yang Wu, Yifan Zhang, Zhenxing Liang et al.
ICML 2024arXiv:2511.16027
4
citations
Multi-View Clustering by Inter-cluster Connectivity Guided Reward
Hao Dai, Yang Liu, Peng Su et al.
ICML 2024