"policy learning" Papers
21 papers found
Conference
Bootstrapped Model Predictive Control
Yuhang Wang, Hanwei Guo, Sizhe Wang et al.
Bridging Equivariant GNNs and Spherical CNNs for Structured Physical Domains
Colin Kohler, Purvik Patel, Nathan Vaska et al.
BTBS-LNS: Binarized-Tightening, Branch and Search on Learning LNS Policies for MIP
Hao Yuan, wenli ouyang, Changwen Zhang et al.
Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections
Xiaomeng Xu, Yifan Hou, Zeyi Liu et al.
Defending Against Sophisticated Poisoning Attacks with RL-based Aggregation in Federated Learning
Yujing Wang, Hainan Zhang, Sijia Wen et al.
DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving
Shuyao Shang, Yuntao Chen, Yuqi Wang et al.
Efficient Reinforcement Learning Through Adaptively Pretrained Visual Encoder
Yuhan Zhang, Guoqing Ma, Guangfu Hao et al.
EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception
Sanjoy Chowdhury, Subrata Biswas, Sayan Nag et al.
LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search
Pengyi Li, Hongyao Tang, Jinbin Qiao et al.
Learning 3D Persistent Embodied World Models
Siyuan Zhou, Yilun Du, Yuncong Yang et al.
PN-GAIL: Leveraging Non-optimal Information from Imperfect Demonstrations
Qiang Liu, Huiqiao Fu, Kaiqiang Tang et al.
SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation
Chenyang Le, Bing Han, Jinshun Li et al.
What Matters in Learning from Large-Scale Datasets for Robot Manipulation
Vaibhav Saxena, Matthew Bronars, Nadun Ranawaka Arachchige et al.
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Guanghe Li, Yixiang Shan, Zhengbang Zhu et al.
Effect-Invariant Mechanisms for Policy Generalization
Sorawit Saengkyongam, Niklas Pfister, Predag Klasnja et al.
Fair Off-Policy Learning from Observational Data
Dennis Frauen, Valentyn Melnychuk, Stefan Feuerriegel
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee, Seung Joon Park, Yunhao Tang et al.
Off-policy Evaluation Beyond Overlap: Sharp Partial Identification Under Smoothness
Samir Khan, Martin Saveski, Johan Ugander
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee, Ming Jin, Javad Lavaei et al.
Policy Learning for Balancing Short-Term and Long-Term Rewards
Peng Wu, Ziyu Shen, Feng Xie et al.
Reinforcement Learning within Tree Search for Fast Macro Placement
Zijie Geng, Jie Wang, Ziyan Liu et al.