"continuous control tasks" Papers

18 papers found

Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning

Hyungkyu Kang, Min-hwan Oh

ICLR 2025arXiv:2503.05306
3
citations

Bootstrapped Model Predictive Control

Yuhang Wang, Hanwei Guo, Sizhe Wang et al.

ICLR 2025arXiv:2503.18871
6
citations

Efficient Discovery of Pareto Front for Multi-Objective Reinforcement Learning

Ruohong Liu, Yuxin Pan, Linjie Xu et al.

ICLR 2025
3
citations

Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling

Jasmine Bayrooti, Carl Ek, Amanda Prorok

ICLR 2025arXiv:2410.04988
3
citations

Interactive and Hybrid Imitation Learning: Provably Beating Behavior Cloning

Yichen Li, Chicheng Zhang

NEURIPS 2025arXiv:2412.07057

Risk-Sensitive Variational Actor-Critic: A Model-Based Approach

Alonso Granados, Mohammadreza Ebrahimi, Jason Pacheco

ICLR 2025
1
citations

Scaling Off-Policy Reinforcement Learning with Batch and Weight Normalization

Daniel Palenicek, Florian Vogt, Joe Watson et al.

NEURIPS 2025arXiv:2502.07523
9
citations

SMoSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks

Mátyás Vincze, Laura Ferrarotti, Leonardo Lucio Custode et al.

AAAI 2025paperarXiv:2412.13053
2
citations

Absolute Policy Optimization: Enhancing Lower Probability Bound of Performance with High Confidence

Weiye Zhao, Feihan Li, Yifan Sun et al.

ICML 2024

ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Tianying Ji, Yongyuan Liang, Yan Zeng et al.

ICML 2024

A Minimaximalist Approach to Reinforcement Learning from Human Feedback

Gokul Swamy, Christoph Dann, Rahul Kidambi et al.

ICML 2024arXiv:2401.04056
139
citations

Diffusion Model-Augmented Behavioral Cloning

Shang-Fu Chen, Hsiang-Chun Wang, Ming-Hao Hsu et al.

ICML 2024oralarXiv:2302.13335
42
citations

EvIL: Evolution Strategies for Generalisable Imitation Learning

Silvia Sapora, Gokul Swamy, Christopher Lu et al.

ICML 2024arXiv:2406.11905
8
citations

Hybrid Inverse Reinforcement Learning

Juntao Ren, Gokul Swamy, Steven Wu et al.

ICML 2024oralarXiv:2402.08848
29
citations

Offline Actor-Critic Reinforcement Learning Scales to Large Models

Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang et al.

ICML 2024oralarXiv:2402.05546
35
citations

Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics

Luca Grillotti, Maxence Faldor, Borja G. León et al.

ICML 2024arXiv:2403.09930
12
citations

Reward Shaping for Reinforcement Learning with An Assistant Reward Agent

Haozhe Ma, Kuankuan Sima, Thanh Vinh Vo et al.

ICML 2024

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic

Tianying Ji, Yu Luo, Fuchun Sun et al.

ICML 2024arXiv:2306.02865
21
citations