α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yishay Mansour
Yishay Mansour
28
papers
311
total citations
papers (28)
Prediction with Corrupted Expert Advice
NEURIPS 2020
arXiv
43
citations
Differentially Private Multi-Armed Bandits in the Shuffle Model
NEURIPS 2021
arXiv
34
citations
Minimax Regret for Stochastic Shortest Path
NEURIPS 2021
arXiv
31
citations
Sample Complexity of Uniform Convergence for Multicalibration
NEURIPS 2020
arXiv
30
citations
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
NEURIPS 2022
arXiv
25
citations
Benign Underfitting of Stochastic Gradient Descent
NEURIPS 2022
arXiv
22
citations
Principal-Agent Reward Shaping in MDPs
AAAI 2024
arXiv
19
citations
Private Learning of Halfspaces: Simplifying the Construction and Reducing the Sample Complexity
NEURIPS 2020
arXiv
17
citations
Oracle-Efficient Regret Minimization in Factored MDPs with Unknown Structure
NEURIPS 2021
arXiv
13
citations
Optimal Rates for Random Order Online Optimization
NEURIPS 2021
arXiv
10
citations
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
ICML 2024
arXiv
9
citations
Reinforcement Learning with Feedback Graphs
NEURIPS 2020
arXiv
9
citations
Eluder-based Regret for Stochastic Contextual MDPs
ICML 2024
arXiv
8
citations
Multiclass Boosting: Simple and Intuitive Weak Learning Criteria
NEURIPS 2023
arXiv
8
citations
Fair Wrapping for Black-box Predictions
NEURIPS 2022
arXiv
7
citations
ROI Maximization in Stochastic Online Decision-Making
NEURIPS 2021
arXiv
6
citations
Eliciting User Preferences for Personalized Multi-Objective Decision Making through Comparative Feedback
NEURIPS 2023
arXiv
6
citations
Delay as Payoff in MAB
AAAI 2025
arXiv
4
citations
Probably Approximately Precision and Recall Learning
NEURIPS 2025
arXiv
3
citations
Batch Ensemble for Variance Dependent Regret in Stochastic Bandits
AAAI 2025
arXiv
3
citations
Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback
NEURIPS 2025
arXiv
2
citations
Convergence of Policy Mirror Descent Beyond Compatible Function Approximation
ICML 2025
arXiv
1
citations
Dueling Bandits with Team Comparisons
NEURIPS 2021
arXiv
1
citations
Adversarially Robust Streaming Algorithms via Differential Privacy
NEURIPS 2020
0
citations
Black-Box Differential Privacy for Interactive ML
NEURIPS 2023
0
citations
Finding Safe Zones of Markov Decision Processes Policies
NEURIPS 2023
0
citations
A Characterization of Semi-Supervised Adversarially Robust PAC Learnability
NEURIPS 2022
0
citations
Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations
NEURIPS 2021
0
citations