"large multi-modality models" Papers
3 papers found
Conference
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Haoning Wu, Zicheng Zhang, Weixia Zhang et al.
ICML 2024arXiv:2312.17090
393
citations
Towards Open-ended Visual Quality Comparison
Haoning Wu, Hanwei Zhu, Zicheng Zhang et al.
ECCV 2024arXiv:2402.16641
95
citations
Using Left and Right Brains Together: Towards Vision and Language Planning
Jun CEN, Chenfei Wu, Xiao Liu et al.
ICML 2024arXiv:2402.10534
11
citations