α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Nan Jiang
Nan Jiang
1
Affiliations
Affiliations
The University of Chicago
26
papers
1,262
total citations
papers (26)
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint
ICML 2024
arXiv
312
citations
Bellman-consistent Pessimism for Offline Reinforcement Learning
NEURIPS 2021
arXiv
308
citations
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
NEURIPS 2021
arXiv
184
citations
Scaling Up Dynamic Human-Scene Interaction Modeling
CVPR 2024
arXiv
107
citations
Full-Body Articulated Human-Object Interaction
ICCV 2023
arXiv
72
citations
Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning
NEURIPS 2021
arXiv
43
citations
Adversarial Model for Offline Reinforcement Learning
NEURIPS 2023
arXiv
36
citations
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL
NEURIPS 2022
arXiv
29
citations
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
NEURIPS 2023
arXiv
24
citations
F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions
ECCV 2024
arXiv
23
citations
GameArena: Evaluating LLM Reasoning through Live Computer Games
ICLR 2025
arXiv
20
citations
Commit0: Library Generation from Scratch
ICLR 2025
arXiv
19
citations
Minimax Value Interval for Off-Policy Evaluation and Policy Optimization
NEURIPS 2020
arXiv
17
citations
Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability
ICLR 2024
arXiv
16
citations
Interaction-Grounded Learning with Action-Inclusive Feedback
NEURIPS 2022
arXiv
11
citations
Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions
NEURIPS 2022
arXiv
9
citations
A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation
NEURIPS 2022
arXiv
6
citations
Racing Control Variable Genetic Programming for Symbolic Regression
AAAI 2024
arXiv
6
citations
LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement
AAAI 2025
arXiv
4
citations
Dynamic Motion Blending for Versatile Motion Editing
CVPR 2025
arXiv
4
citations
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
NEURIPS 2025
arXiv
4
citations
Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret
NEURIPS 2022
arXiv
4
citations
A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning
NEURIPS 2025
arXiv
3
citations
Solving Satisfiability Modulo Counting for Symbolic and Statistical AI Integration with Provable Guarantees
AAAI 2024
arXiv
1
citations
Active Symbolic Discovery of Ordinary Differential Equations via Phase Portrait Sketching
AAAI 2025
arXiv
0
citations
When Counterpoint Meets Chinese Folk Melodies
NEURIPS 2020
0
citations