α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Amrit Singh Bedi
Amrit Singh Bedi
8
papers
287
total citations
papers (8)
Variational Policy Gradient Method for Reinforcement Learning with General Utilities
NEURIPS 2020
arXiv
154
citations
MaxMin-RLHF: Alignment with Diverse Human Preferences
ICML 2024
arXiv
88
citations
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
CVPR 2025
arXiv
18
citations
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
ICML 2024
arXiv
18
citations
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
ICML 2024
arXiv
5
citations
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles
ICML 2024
arXiv
4
citations
Position: On the Possibilities of AI-Generated Text Detection
ICML 2024
0
citations
Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization
ICML 2024
arXiv
0
citations