α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Shenao Zhang
Shenao Zhang
7
papers
51
total citations
papers (7)
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
NEURIPS 2023
arXiv
26
citations
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
ICML 2025
arXiv
9
citations
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
NEURIPS 2022
arXiv
6
citations
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICML 2025
arXiv
6
citations
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
NEURIPS 2023
arXiv
4
citations
Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations
ICML 2024
0
citations
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents
ICML 2024
0
citations