α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xiaoteng Ma
Xiaoteng Ma
8
papers
418
total citations
papers (8)
Mildly Conservative Q-Learning for Offline Reinforcement Learning
NEURIPS 2022
arXiv
141
citations
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
NEURIPS 2022
arXiv
105
citations
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning
NEURIPS 2021
arXiv
101
citations
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
NEURIPS 2023
arXiv
28
citations
Efficient Multi-agent Reinforcement Learning by Planning
ICLR 2024
arXiv
18
citations
Single-Trajectory Distributionally Robust Reinforcement Learning
ICML 2024
arXiv
17
citations
Learning Diverse Risk Preferences in Population-Based Self-Play
AAAI 2024
arXiv
8
citations
Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping
NEURIPS 2022
0
citations