α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Juntao Dai
Juntao Dai
4
papers
87
total citations
papers (4)
Constrained Update Projection Approach to Safe Policy Optimization
NEURIPS 2022
arXiv
74
citations
Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback
NEURIPS 2025
arXiv
9
citations
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation
ICML 2024
arXiv
3
citations
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback
NEURIPS 2025
arXiv
1
citations