α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Ganqu Cui
Ganqu Cui
10
papers
1,185
total citations
papers (10)
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
CVPR 2024
arXiv
361
citations
ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback
ICML 2024
arXiv
214
citations
Advancing LLM Reasoning Generalists with Preference Trees
ICLR 2025
arXiv
183
citations
Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations
NEURIPS 2023
arXiv
134
citations
TTRL: Test-Time Reinforcement Learning
NEURIPS 2025
arXiv
129
citations
A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks
NEURIPS 2022
arXiv
96
citations
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
CVPR 2025
arXiv
60
citations
Scaling Physical Reasoning with the PHYSICS Dataset
NEURIPS 2025
arXiv
6
citations
AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
COLM 2025
arXiv
2
citations
Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models
NEURIPS 2022
0
citations