α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Souradip Chakraborty
Souradip Chakraborty
7
papers
197
total citations
papers (7)
MaxMin-RLHF: Alignment with Diverse Human Preferences
ICML 2024
arXiv
88
citations
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
ICLR 2024
arXiv
38
citations
Does Thinking More Always Help? Mirage of Test-Time Scaling in Reasoning Models
NEURIPS 2025
arXiv
24
citations
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?
AAAI 2025
arXiv
20
citations
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
CVPR 2025
arXiv
18
citations
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
ICLR 2024
arXiv
9
citations
Position: On the Possibilities of AI-Generated Text Detection
ICML 2024
0
citations