"actor-critic algorithms" Papers
7 papers found
Conference
$q$-exponential family for policy optimization
Lingwei Zhu, Haseeb Shah, Han Wang et al.
ICLR 2025arXiv:2408.07245
2
citations
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Arnav Kumar Jain, Harley Wiltzer, Jesse Farebrother et al.
ICLR 2025arXiv:2411.07007
6
citations
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Samuel Garcin, Trevor McInroe, Pablo Samuel Castro et al.
ICLR 2025arXiv:2503.06343
5
citations
Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization
Mudit Gaur, Amrit Singh Bedi, Di Wang et al.
ICML 2024spotlightarXiv:2405.01843
Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning
Matteo Bettini, Ryan Kortvelesy, Amanda Prorok
ICML 2024oralarXiv:2405.15054
9
citations
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Shusheng Xu, Wei Fu, Jiaxuan Gao et al.
ICML 2024arXiv:2404.10719
253
citations
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang et al.
ICML 2024oralarXiv:2402.05546
35
citations