Highlight "large multimodal models" Papers
4 papers found
Conference
AIpparel: A Multimodal Foundation Model for Digital Garments
Kiyohiro Nakayama, Jan Ackermann, Timur Levent Kesdogan et al.
CVPR 2025highlightarXiv:2412.03937
5
citations
DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding
Geng Li, Jinglin Xu, Yunzhen Zhao et al.
CVPR 2025highlightarXiv:2504.14920
29
citations
LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs
Jiarui Wang, Huiyu Duan, Yu Zhao et al.
ICCV 2025highlightarXiv:2504.08358
16
citations
Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Zhang Li, Biao Yang, Qiang Liu et al.
CVPR 2024highlightarXiv:2311.06607
392
citations