α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Tian Xu
Tian Xu
6
papers
333
total citations
papers (6)
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
ICML 2024
arXiv
147
citations
Error Bounds of Imitating Policies and Environments
NEURIPS 2020
arXiv
135
citations
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
ICLR 2025
arXiv
37
citations
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
ICLR 2024
arXiv
14
citations
Imitation Learning from Imperfection: Theoretical Justifications and Algorithms
NEURIPS 2023
0
citations
Limited Preference Aided Imitation Learning from Imperfect Demonstrations
ICML 2024
0
citations