"safe reinforcement learning" Papers

18 papers found

Adaptable Safe Policy Learning from Multi-task Data with Constraint Prioritized Decision Transformer

Ruiqi Xue, Ziqian Zhang, Lihe Li et al.

NEURIPS 2025

Alignment of Large Language Models with Constrained Learning

Botong Zhang, Shuo Li, Ignacio Hounie et al.

NEURIPS 2025arXiv:2505.19387
2
citations

Explainably Safe Reinforcement Learning

Sabine Rieder, Stefan Pranger, Debraj Chakraborty et al.

NEURIPS 2025

Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty

Xu Wan, Chao Yang, Cheng Yang et al.

NEURIPS 2025

HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents

Tristan Tomilin, Meng Fang, Mykola Pechenizkiy

ICLR 2025arXiv:2503.08241
5
citations

MOSDT: Self-Distillation-Based Decision Transformer for Multi-Agent Offline Safe Reinforcement Learning

Yuchen Xia, Yunjian Xu

NEURIPS 2025

Online Optimization for Offline Safe Reinforcement Learning

Yassine Chemingui, Aryan Deshwal, Alan Fern et al.

NEURIPS 2025arXiv:2510.22027

Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation

Toshinori Kitamura, Arnob Ghosh, Tadashi Kozuno et al.

NEURIPS 2025spotlightarXiv:2502.10138

Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback

Jiaming Ji, Xinyu Chen, Rui Pan et al.

NEURIPS 2025arXiv:2503.17682
9
citations

SonoGym: High Performance Simulation for Challenging Surgical Tasks with Robotic Ultrasound

Yunke Ao, Masoud Moghani, Mayank Mittal et al.

NEURIPS 2025arXiv:2507.01152
1
citations

Tilted Quantile Gradient Updates for Quantile-Constrained Reinforcement Learning

Chenglin Li, Guangchun Ruan, Hua Geng

AAAI 2025paperarXiv:2412.13184
1
citations

Feasibility Consistent Representation Learning for Safe Reinforcement Learning

Zhepeng Cen, Yihang Yao, Zuxin Liu et al.

ICML 2024arXiv:2405.11718
3
citations

Feasible Reachable Policy Iteration

Shentao Qin, Yujie Yang, Yao Mu et al.

ICML 2024

Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning

Huy Hoang, Tien Mai, Pradeep Varakantham

AAAI 2024paperarXiv:2312.10385
8
citations

Langevin Policy for Safe Reinforcement Learning

Fenghao Lei, Long Yang, Shiting Wen et al.

ICML 2024

SafeDreamer: Safe Reinforcement Learning with World Models

Weidong Huang, Jiaming Ji, Chunhe Xia et al.

ICLR 2024arXiv:2307.07176
37
citations

Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation

Juntao Dai, Yaodong Yang, Qian Zheng et al.

ICML 2024arXiv:2412.11138
3
citations

Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning

Zijian Guo, Weichao Zhou, Wenchao Li

ICML 2024oralarXiv:2402.17217
5
citations