"deep reinforcement learning" Papers

36 papers found

Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning

Yaoquan Wei, Shunyu Liu, Jie Song et al.

AAAI 2025paperarXiv:2311.16807
1
citations

APIRL: Deep Reinforcement Learning for REST API Fuzzing

Myles Foley, Sergio Maffeis

AAAI 2025paperarXiv:2412.15991
5
citations

Contrastive Representation for Interactive Recommendation

Jingyu Li, Zhiyong Feng, Dongxiao He et al.

AAAI 2025paperarXiv:2412.18396

Estimating cognitive biases with attention-aware inverse planning

Sounak Banerjee, Daphne Cornelisse, Deepak Gopinath et al.

NEURIPS 2025spotlightarXiv:2510.25951
1
citations

Graph-Supported Dynamic Algorithm Configuration for Multi-Objective Combinatorial Optimization

Robbert Reijnen, Yaoxin Wu, Zaharah Bukhsh et al.

ICML 2025arXiv:2505.16471
1
citations

Logic-Q: Improving Deep Reinforcement Learning-based Quantitative Trading via Program Sketch-based Tuning

Zhiming Li, Junzhe Jiang, Yushi Cao et al.

AAAI 2025paperarXiv:2310.05551
3
citations

Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning

Jiashun Liu, Zihao Wu, Johan Obando Ceron et al.

NEURIPS 2025arXiv:2505.24061
4
citations

Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning

Ghada Sokar, Pablo Samuel Castro

NEURIPS 2025arXiv:2505.17749
1
citations

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Guozheng Ma, Lu Li, Zilin Wang et al.

ICML 2025oralarXiv:2506.17204
7
citations

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors

Fengshuo Bai, Runze Liu, Yali Du et al.

AAAI 2025paperarXiv:2412.10713
12
citations

Solving Continuous Mean Field Games: Deep Reinforcement Learning for Non-Stationary Dynamics

Lorenzo Magnino, Kai Shao, Zida Wu et al.

NEURIPS 2025arXiv:2510.22158
2
citations

Solving hidden monotone variational inequalities with surrogate losses

Ryan D'Orazio, Danilo Vucetic, Zichu Liu et al.

ICLR 2025arXiv:2411.05228

SonoGym: High Performance Simulation for Challenging Surgical Tasks with Robotic Ultrasound

Yunke Ao, Masoud Moghani, Mayank Mittal et al.

NEURIPS 2025arXiv:2507.01152
1
citations

Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning

Roger Creus Castanyer, Johan Obando Ceron, Lu Li et al.

NEURIPS 2025spotlightarXiv:2506.15544
10
citations

Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning

Yunpeng Jiang, Jianshu Hu, Paul Weng et al.

NEURIPS 2025oralarXiv:2505.13925

ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning

Mingqi Yuan, Bo Li, Xin Jin et al.

ICCV 2025arXiv:2503.06101
1
citations

Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment

Chen Zhang, Qiang HE, Yuan Zhou et al.

ICML 2024arXiv:2406.01103
6
citations

Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System

Ruining Zhang, Haoran Han, Maolong Lv et al.

AAAI 2024paperarXiv:2312.10472
4
citations

Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents

Chung-En Sun, Sicun Gao, Lily Weng

ICML 2024arXiv:2406.18062
6
citations

Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

Guy Azran, Mohamad H Danesh, Stefano Albrecht et al.

AAAI 2024paperarXiv:2307.05209
2
citations

Discerning Temporal Difference Learning

Jianfei Ma

AAAI 2024paperarXiv:2310.08091
1
citations

Distributional Bellman Operators over Mean Embeddings

Li Kevin Wenliang, Gregoire Deletang, Matthew Aitchison et al.

ICML 2024oralarXiv:2312.07358
4
citations

DynSyn: Dynamical Synergistic Representation for Efficient Learning and Control in Overactuated Embodied Systems

Kaibo He, Chenhui Zuo, Chengtian Ma et al.

ICML 2024arXiv:2407.11472
10
citations

Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing

Lyudong Jin, Ming Tang, Meng Zhang et al.

AAAI 2024paperarXiv:2312.10418
6
citations

Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning

Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.

AAAI 2024paperarXiv:2312.05784
10
citations

In value-based deep reinforcement learning, a pruned network is a good network

Johan Obando Ceron, Aaron Courville, Pablo Samuel Castro

ICML 2024arXiv:2402.12479
33
citations

INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer

Han Fang, Zhihao Song, Paul Weng et al.

ICML 2024arXiv:2402.02317
32
citations

Learning Coverage Paths in Unknown Environments with Deep Reinforcement Learning

Arvi Jonnarth, Jie Zhao, Michael Felsberg

ICML 2024arXiv:2306.16978
18
citations

Learning the Target Network in Function Space

Kavosh Asadi, Yao Liu, Shoham Sabach et al.

ICML 2024arXiv:2406.01838
1
citations

SHINE: Shielding Backdoors in Deep Reinforcement Learning

Zhuowen Yuan, Wenbo Guo, Jinyuan Jia et al.

ICML 2024

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL

Jesse Farebrother, Jordi Orbay, Quan Vuong et al.

ICML 2024arXiv:2403.03950
107
citations

Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Hyeonah Kim, Minsu Kim, Sungsoo Ahn et al.

ICML 2024arXiv:2306.01276
9
citations

Task Planning for Object Rearrangement in Multi-Room Environments

Karan Mirakhor, Sourav Ghosh, Dipanjan Das et al.

AAAI 2024paperarXiv:2406.00451
2
citations

Understanding and Diagnosing Deep Reinforcement Learning

Ezgi Korkmaz

ICML 2024arXiv:2406.16979
7
citations

Unlock the Cognitive Generalization of Deep Reinforcement Learning via Granular Ball Representation

Jiashun Liu, Jianye Hao, Yi Ma et al.

ICML 2024

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning

Jin Hwa Lee, Stefano Mannelli, Andrew Saxe

ICML 2024arXiv:2402.18361
12
citations