α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Donghai Hong
Donghai Hong
3
papers
17
total citations
papers (3)
Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback
NEURIPS 2025
arXiv
9
citations
Generative RLHF-V: Learning Principles from Multi-modal Human Preference
NEURIPS 2025
arXiv
7
citations
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback
NEURIPS 2025
arXiv
1
citations