"value-based methods" Papers
5 papers found
Conference
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
Joey Hong, Anca Dragan, Sergey Levine
ICLR 2025arXiv:2411.05193
8
citations
Value-Guided Decision Transformer: A Unified Reinforcement Learning Framework for Online and Offline Settings
Hongling Zheng, Li Shen, Yong Luo et al.
NEURIPS 2025
Augmenting Decision with Hypothesis in Reinforcement Learning
Nguyen Minh Quang, Hady Lauw
ICML 2024
In value-based deep reinforcement learning, a pruned network is a good network
Johan Obando Ceron, Aaron Courville, Pablo Samuel Castro
ICML 2024arXiv:2402.12479
33
citations
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
Asaf Cassel, Haipeng Luo, Aviv Rosenberg et al.
ICML 2024arXiv:2405.07637
5
citations