Poster "continuous control tasks" Papers
14 papers found
Conference
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Hyungkyu Kang, Min-hwan Oh
ICLR 2025arXiv:2503.05306
3
citations
Bootstrapped Model Predictive Control
Yuhang Wang, Hanwei Guo, Sizhe Wang et al.
ICLR 2025arXiv:2503.18871
6
citations
Efficient Discovery of Pareto Front for Multi-Objective Reinforcement Learning
Ruohong Liu, Yuxin Pan, Linjie Xu et al.
ICLR 2025
3
citations
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti, Carl Ek, Amanda Prorok
ICLR 2025arXiv:2410.04988
3
citations
Interactive and Hybrid Imitation Learning: Provably Beating Behavior Cloning
Yichen Li, Chicheng Zhang
NEURIPS 2025arXiv:2412.07057
Risk-Sensitive Variational Actor-Critic: A Model-Based Approach
Alonso Granados, Mohammadreza Ebrahimi, Jason Pacheco
ICLR 2025
1
citations
Scaling Off-Policy Reinforcement Learning with Batch and Weight Normalization
Daniel Palenicek, Florian Vogt, Joe Watson et al.
NEURIPS 2025arXiv:2502.07523
9
citations
Absolute Policy Optimization: Enhancing Lower Probability Bound of Performance with High Confidence
Weiye Zhao, Feihan Li, Yifan Sun et al.
ICML 2024
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji, Yongyuan Liang, Yan Zeng et al.
ICML 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy, Christoph Dann, Rahul Kidambi et al.
ICML 2024arXiv:2401.04056
139
citations
EvIL: Evolution Strategies for Generalisable Imitation Learning
Silvia Sapora, Gokul Swamy, Christopher Lu et al.
ICML 2024arXiv:2406.11905
8
citations
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics
Luca Grillotti, Maxence Faldor, Borja G. León et al.
ICML 2024arXiv:2403.09930
12
citations
Reward Shaping for Reinforcement Learning with An Assistant Reward Agent
Haozhe Ma, Kuankuan Sima, Thanh Vinh Vo et al.
ICML 2024
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic
Tianying Ji, Yu Luo, Fuchun Sun et al.
ICML 2024arXiv:2306.02865
21
citations