α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yu Bai
Yu Bai
19
papers
963
total citations
papers (19)
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection
NEURIPS 2023
arXiv
271
citations
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
NEURIPS 2021
arXiv
184
citations
Near-Optimal Reinforcement Learning with Self-Play
NEURIPS 2020
arXiv
143
citations
Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games
NEURIPS 2021
arXiv
76
citations
Near-Optimal Offline Reinforcement Learning via Double Variance Reduction
NEURIPS 2021
arXiv
71
citations
Towards Understanding Hierarchical Learning: Benefits of Neural Representations
NEURIPS 2020
arXiv
53
citations
What can a Single Attention Layer Learn? A Study Through the Random Features Lens
NEURIPS 2023
arXiv
39
citations
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
NEURIPS 2022
arXiv
31
citations
TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model
CVPR 2025
arXiv
24
citations
Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent
NEURIPS 2022
arXiv
21
citations
Understanding the Under-Coverage Bias in Uncertainty Estimation
NEURIPS 2021
arXiv
14
citations
Identifying good directions to escape the NTK regime and efficiently learn low-degree plus sparse polynomials
NEURIPS 2022
arXiv
12
citations
DeIL: Direct-and-Inverse CLIP for Open-World Few-Shot Learning
CVPR 2024
11
citations
Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games
NEURIPS 2022
arXiv
10
citations
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective
ICML 2024
arXiv
3
citations
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations
NEURIPS 2023
0
citations
Text2Data: Low-Resource Data Generation with Textual Control
AAAI 2025
arXiv
0
citations
Collaborative Consortium of Foundation Models for Open-World Few-Shot Learning
AAAI 2024
0
citations
Excluding the Impossible for Open Vocabulary Semantic Segmentation
AAAI 2025
0
citations