α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Chongjie Zhang
Chongjie Zhang
20
papers
610
total citations
papers (20)
Celebrating Diversity in Shared Multi-Agent Reinforcement Learning
NEURIPS 2021
arXiv
181
citations
Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration
NEURIPS 2021
arXiv
115
citations
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
NEURIPS 2022
arXiv
105
citations
Offline Reinforcement Learning with Reverse Model-based Imagination
NEURIPS 2021
arXiv
70
citations
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization
NEURIPS 2021
arXiv
45
citations
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
NEURIPS 2020
arXiv
21
citations
Low-Rank Modular Reinforcement Learning via Muscle Synergy
NEURIPS 2022
arXiv
20
citations
Unsupervised Behavior Extraction via Random Intent Priors
NEURIPS 2023
arXiv
15
citations
Non-Linear Coordination Graphs
NEURIPS 2022
arXiv
9
citations
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
ICML 2024
arXiv
8
citations
CUP: Critic-Guided Policy Reuse
NEURIPS 2022
arXiv
8
citations
Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design
ICLR 2024
arXiv
5
citations
Learning to Plan Before Answering: Self-Teaching LLMs to Learn Abstract Plans for Problem Solving
ICLR 2025
arXiv
4
citations
Enhancing Decision-Making of Large Language Models via Actor-Critic
ICML 2025
arXiv
4
citations
Planning, Fast and Slow: Online Reinforcement Learning with Action-Free Offline Data via Multiscale Planners
ICML 2024
0
citations
Model-Based Reinforcement Learning via Imagination with Derived Memory
NEURIPS 2021
0
citations
Conservative Offline Policy Adaptation in Multi-Agent Games
NEURIPS 2023
0
citations
LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning
NEURIPS 2022
0
citations
Safe Opponent-Exploitation Subgame Refinement
NEURIPS 2022
0
citations
On the Estimation Bias in Double Q-Learning
NEURIPS 2021
arXiv
0
citations