"multimodal comprehension" Papers
4 papers found
Conference
GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs
Yi Fang, Bowen Jin, Jiacheng Shen et al.
CVPR 2025arXiv:2502.11925
7
citations
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE
YONGWEI CHEN, Yushi Lan, Shangchen Zhou et al.
CVPR 2025arXiv:2411.16856
23
citations
Auto-Encoding Morph-Tokens for Multimodal LLM
Kaihang Pan, Siliang Tang, Juncheng Li et al.
ICML 2024spotlightarXiv:2405.01926
32
citations
Paying More Attention to Images: A Training-Free Method for Alleviating Hallucination in LVLMs
Shi Liu, Kecheng Zheng, Wei Chen
ECCV 2024arXiv:2407.21771
134
citations