α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Shangtong Zhang
Shangtong Zhang
8
papers
53
total citations
papers (8)
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
ICLR 2025
arXiv
15
citations
Learning Retrospective Knowledge with Reverse Reinforcement Learning
NEURIPS 2020
arXiv
13
citations
Efficient Policy Evaluation with Offline Data Informed Behavior Policy Design
ICML 2024
arXiv
7
citations
Revisiting a Design Choice in Gradient Temporal Difference Learning
ICLR 2025
arXiv
6
citations
Doubly Optimal Policy Evaluation for Reinforcement Learning
ICLR 2025
arXiv
5
citations
Linear $Q$-Learning Does Not Diverge in $L^2$: Convergence Rates to a Bounded Set
ICML 2025
arXiv
4
citations
Efficient Multi-Policy Evaluation for Reinforcement Learning
AAAI 2025
arXiv
2
citations
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
ICLR 2025
arXiv
1
citations