Poster "soft actor-critic" Papers
2 papers found
Conference
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Michal Nauman, Michał Bortkiewicz, Piotr Milos et al.
ICML 2024arXiv:2403.00514
41
citations
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
Yukinari Hisaki, Isao Ono
ICML 2024arXiv:2408.01972
4
citations