α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
David Lindner
David Lindner
6
papers
1,049
total citations
papers (6)
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
ICLR 2025
arXiv
750
citations
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning
ICLR 2024
arXiv
137
citations
Tracr: Compiled Transformers as a Laboratory for Interpretability
NEURIPS 2023
arXiv
91
citations
Active Exploration for Inverse Reinforcement Learning
NEURIPS 2022
arXiv
33
citations
Information Directed Reward Learning for Reinforcement Learning
NEURIPS 2021
arXiv
25
citations
Large language models can learn and generalize steganographic chain-of-thought under process supervision
NEURIPS 2025
arXiv
13
citations