α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Tengyang Xie
Tengyang Xie
8
papers
749
total citations
papers (8)
Bellman-consistent Pessimism for Offline Reinforcement Learning
NEURIPS 2021
arXiv
308
citations
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
NEURIPS 2021
arXiv
184
citations
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
ICML 2024
arXiv
179
citations
Adversarial Model for Offline Reinforcement Learning
NEURIPS 2023
arXiv
36
citations
Reinforce LLM Reasoning through Multi-Agent Reflection
ICML 2025
arXiv
19
citations
Interaction-Grounded Learning with Action-Inclusive Feedback
NEURIPS 2022
arXiv
11
citations
Do We Need to Verify Step by Step? Rethinking Process Supervision from a Theoretical Perspective
ICML 2025
arXiv
8
citations
Outcome-Based Online Reinforcement Learning: Algorithms and Fundamental Limits
NEURIPS 2025
arXiv
4
citations