"value function estimation" Papers
6 papers found
Conference
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
Kianté Brantley, Mingyu Chen, Zhaolin Gao et al.
NEURIPS 2025arXiv:2505.20686
12
citations
Bootstrapped Model Predictive Control
Yuhang Wang, Hanwei Guo, Sizhe Wang et al.
ICLR 2025arXiv:2503.18871
6
citations
In-Context Fully Decentralized Cooperative Multi-Agent Reinforcement Learning
Chao Li, Bingkun BAO, Yang Gao
NEURIPS 2025
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
Pai Liu, Lingfeng Zhao, Shivangi Agarwal et al.
NEURIPS 2025arXiv:2502.08021
4
citations
Discerning Temporal Difference Learning
Jianfei Ma
AAAI 2024paperarXiv:2310.08091
1
citations
Enhancing Value Function Estimation through First-Order State-Action Dynamics in Offline Reinforcement Learning
Yun-Hsuan Lien, Ping-Chun Hsieh, Tzu-Mao Li et al.
ICML 2024