α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Mohammad Ghavamzadeh
Mohammad Ghavamzadeh
13
papers
592
total citations
papers (13)
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
NEURIPS 2023
arXiv
311
citations
Robust Reinforcement Learning using Offline Data
NEURIPS 2022
arXiv
112
citations
Efficient Risk-Averse Reinforcement Learning
NEURIPS 2022
arXiv
54
citations
Adaptive Sampling for Minimax Fair Classification
NEURIPS 2021
arXiv
43
citations
Does Thinking More Always Help? Mirage of Test-Time Scaling in Reasoning Models
NEURIPS 2025
arXiv
24
citations
On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes
NEURIPS 2023
arXiv
13
citations
Operator Splitting Value Iteration
NEURIPS 2022
arXiv
9
citations
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models
ICLR 2024
arXiv
7
citations
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
NEURIPS 2023
arXiv
7
citations
Online Planning with Lookahead Policies
NEURIPS 2020
arXiv
5
citations
Ordering-based Conditions for Global Convergence of Policy Gradient Methods
NEURIPS 2023
arXiv
4
citations
Private and Communication-Efficient Algorithms for Entropy Estimation
NEURIPS 2022
arXiv
3
citations
Bayesian Regret Minimization in Offline Bandits
ICML 2024
arXiv
0
citations