α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Lin Yang
Lin Yang
28
papers
906
total citations
papers (28)
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
NEURIPS 2020
arXiv
133
citations
On Reward-Free Reinforcement Learning with Linear Function Approximation
NEURIPS 2020
arXiv
114
citations
Toward the Fundamental Limits of Imitation Learning
NEURIPS 2020
arXiv
108
citations
Task-Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification
CVPR 2023
arXiv
84
citations
PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology
AAAI 2024
arXiv
79
citations
Preference-based Reinforcement Learning with Finite-Time Guarantees
NEURIPS 2020
arXiv
72
citations
Attention-Challenging Multiple Instance Learning for Whole Slide Image Classification
ECCV 2024
arXiv
66
citations
Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension
NEURIPS 2020
arXiv
55
citations
Near-Optimal Sample Complexity Bounds for Constrained MDPs
NEURIPS 2022
arXiv
44
citations
WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering
ECCV 2024
arXiv
25
citations
CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology
CVPR 2025
arXiv
23
citations
Replicability in Reinforcement Learning
NEURIPS 2023
arXiv
18
citations
Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning
NEURIPS 2022
arXiv
16
citations
Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning?
NEURIPS 2020
arXiv
14
citations
Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds
NEURIPS 2023
arXiv
13
citations
Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning
NEURIPS 2021
arXiv
11
citations
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs
NEURIPS 2021
arXiv
11
citations
DPA-P2PNet: Deformable Proposal-Aware P2PNet for Accurate Point-Based Cell Detection
AAAI 2024
arXiv
8
citations
Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning
NEURIPS 2020
arXiv
7
citations
Efficient Robust Bayesian Optimization for Arbitrary Uncertain inputs
NEURIPS 2023
arXiv
5
citations
Cooperative Stochastic Bandits with Asynchronous Agents and Constrained Feedback
NEURIPS 2021
0
citations
Cross-Patch Dense Contrastive Learning for Semi-Supervised Segmentation of Cellular Nuclei in Histopathologic Images
CVPR 2022
0
citations
Learning from Distributed Users in Contextual Linear Bandits Without Sharing the Context
NEURIPS 2022
0
citations
Is Long Horizon RL More Difficult Than Short Horizon RL?
NEURIPS 2020
0
citations
Transfer Q-Learning with Composite MDP Structures
ICML 2025
0
citations
Efficient Batched Algorithm for Contextual Linear Bandits with Large Action Space via Soft Elimination
NEURIPS 2023
0
citations
Planning with General Objective Functions: Going Beyond Total Rewards
NEURIPS 2020
0
citations
On the Value of Interaction and Function Approximation in Imitation Learning
NEURIPS 2021
0
citations