α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Chenjia Bai
Chenjia Bai
1
Affiliations
Affiliations
Institute of AI, ChinaTelecom
18
papers
496
total citations
papers (18)
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
NEURIPS 2023
arXiv
138
citations
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
NEURIPS 2022
arXiv
105
citations
KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills
NEURIPS 2025
arXiv
41
citations
Dynamic Bottleneck for Robust Self-Supervised Exploration
NEURIPS 2021
arXiv
36
citations
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
NEURIPS 2023
arXiv
28
citations
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
ICML 2024
arXiv
27
citations
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
ICML 2024
arXiv
25
citations
Online Preference Alignment for Language Models via Count-based Exploration
ICLR 2025
arXiv
20
citations
Adversarial Locomotion and Motion Imitation for Humanoid Policy Learning
NEURIPS 2025
arXiv
15
citations
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
AAAI 2024
arXiv
13
citations
Radiology Report Generation via Multi-objective Preference Optimization
AAAI 2025
arXiv
10
citations
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
ICML 2024
arXiv
10
citations
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
AAAI 2025
arXiv
8
citations
Constrained Ensemble Exploration for Unsupervised Skill Discovery
ICML 2024
arXiv
7
citations
HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning
NEURIPS 2025
arXiv
6
citations
Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner
ICML 2025
arXiv
4
citations
Information-Theoretic Reward Decomposition for Generalizable RLHF
NEURIPS 2025
arXiv
3
citations
How Does Goal Relabeling Improve Sample Efficiency?
ICML 2024
0
citations