"deep reinforcement learning" Papers

36 papers found

Filters:deep reinforcement learning Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning

Yaoquan Wei, Shunyu Liu, Jie Song et al.

AAAI 2025paperarXiv:2311.16807

citations

APIRL: Deep Reinforcement Learning for REST API Fuzzing

Myles Foley, Sergio Maffeis

AAAI 2025paperarXiv:2412.15991

citations

Contrastive Representation for Interactive Recommendation

Jingyu Li, Zhiyong Feng, Dongxiao He et al.

AAAI 2025paperarXiv:2412.18396

Estimating cognitive biases with attention-aware inverse planning

Sounak Banerjee, Daphne Cornelisse, Deepak Gopinath et al.

NEURIPS 2025spotlightarXiv:2510.25951

citations

Graph-Supported Dynamic Algorithm Configuration for Multi-Objective Combinatorial Optimization

Robbert Reijnen, Yaoxin Wu, Zaharah Bukhsh et al.

ICML 2025arXiv:2505.16471

citations

Logic-Q: Improving Deep Reinforcement Learning-based Quantitative Trading via Program Sketch-based Tuning

Zhiming Li, Junzhe Jiang, Yushi Cao et al.

AAAI 2025paperarXiv:2310.05551

citations

Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning

Jiashun Liu, Zihao Wu, Johan Obando Ceron et al.

NEURIPS 2025arXiv:2505.24061

citations

Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning

Ghada Sokar, Pablo Samuel Castro

NEURIPS 2025arXiv:2505.17749

citations

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Guozheng Ma, Lu Li, Zilin Wang et al.

ICML 2025oralarXiv:2506.17204

citations

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors

Fengshuo Bai, Runze Liu, Yali Du et al.

AAAI 2025paperarXiv:2412.10713

citations

Solving Continuous Mean Field Games: Deep Reinforcement Learning for Non-Stationary Dynamics

Lorenzo Magnino, Kai Shao, Zida Wu et al.

NEURIPS 2025arXiv:2510.22158

citations

Solving hidden monotone variational inequalities with surrogate losses

Ryan D'Orazio, Danilo Vucetic, Zichu Liu et al.

ICLR 2025arXiv:2411.05228

SonoGym: High Performance Simulation for Challenging Surgical Tasks with Robotic Ultrasound

Yunke Ao, Masoud Moghani, Mayank Mittal et al.

NEURIPS 2025arXiv:2507.01152

citations

Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning

Roger Creus Castanyer, Johan Obando Ceron, Lu Li et al.

NEURIPS 2025spotlightarXiv:2506.15544

citations

Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning

Yunpeng Jiang, Jianshu Hu, Paul Weng et al.

NEURIPS 2025oralarXiv:2505.13925

ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning

Mingqi Yuan, Bo Li, Xin Jin et al.

ICCV 2025arXiv:2503.06101

citations

Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment

Chen Zhang, Qiang HE, Yuan Zhou et al.

ICML 2024arXiv:2406.01103

citations

Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System

Ruining Zhang, Haoran Han, Maolong Lv et al.

AAAI 2024paperarXiv:2312.10472

citations

Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents

Chung-En Sun, Sicun Gao, Lily Weng

ICML 2024arXiv:2406.18062

citations

Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

Guy Azran, Mohamad H Danesh, Stefano Albrecht et al.

AAAI 2024paperarXiv:2307.05209

citations

Discerning Temporal Difference Learning

Jianfei Ma

AAAI 2024paperarXiv:2310.08091

citations

Distributional Bellman Operators over Mean Embeddings

Li Kevin Wenliang, Gregoire Deletang, Matthew Aitchison et al.

ICML 2024oralarXiv:2312.07358

citations

DynSyn: Dynamical Synergistic Representation for Efficient Learning and Control in Overactuated Embodied Systems

Kaibo He, Chenhui Zuo, Chengtian Ma et al.

ICML 2024arXiv:2407.11472

citations

Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing

Lyudong Jin, Ming Tang, Meng Zhang et al.

AAAI 2024paperarXiv:2312.10418

citations

Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning

Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.

AAAI 2024paperarXiv:2312.05784

citations

In value-based deep reinforcement learning, a pruned network is a good network

Johan Obando Ceron, Aaron Courville, Pablo Samuel Castro

ICML 2024arXiv:2402.12479

citations

INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer

Han Fang, Zhihao Song, Paul Weng et al.

ICML 2024arXiv:2402.02317

citations

Learning Coverage Paths in Unknown Environments with Deep Reinforcement Learning

Arvi Jonnarth, Jie Zhao, Michael Felsberg

ICML 2024arXiv:2306.16978

citations

Learning the Target Network in Function Space

Kavosh Asadi, Yao Liu, Shoham Sabach et al.

ICML 2024arXiv:2406.01838

citations

SHINE: Shielding Backdoors in Deep Reinforcement Learning

Zhuowen Yuan, Wenbo Guo, Jinyuan Jia et al.

ICML 2024

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL

Jesse Farebrother, Jordi Orbay, Quan Vuong et al.

ICML 2024arXiv:2403.03950

107

citations

Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Hyeonah Kim, Minsu Kim, Sungsoo Ahn et al.

ICML 2024arXiv:2306.01276

citations

Task Planning for Object Rearrangement in Multi-Room Environments

Karan Mirakhor, Sourav Ghosh, Dipanjan Das et al.

AAAI 2024paperarXiv:2406.00451

citations

Understanding and Diagnosing Deep Reinforcement Learning

Ezgi Korkmaz

ICML 2024arXiv:2406.16979

citations

Unlock the Cognitive Generalization of Deep Reinforcement Learning via Granular Ball Representation

Jiashun Liu, Jianye Hao, Yi Ma et al.

ICML 2024

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning

Jin Hwa Lee, Stefano Mannelli, Andrew Saxe

ICML 2024arXiv:2402.18361

citations