α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Kaiwen Wang
Kaiwen Wang
6
papers
75
total citations
papers (6)
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
NEURIPS 2023
arXiv
25
citations
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
ICML 2024
arXiv
17
citations
Deep Multi-Modal Structural Equations For Causal Effect Estimation With Unstructured Proxies
NEURIPS 2022
arXiv
14
citations
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
NEURIPS 2025
arXiv
12
citations
Value-Guided Search for Efficient Chain-of-Thought Reasoning
NEURIPS 2025
arXiv
7
citations
Switching the Loss Reduces the Cost in Batch Reinforcement Learning
ICML 2024
0
citations