"multimodal large models" Papers
5 papers found
Conference
Enhancing Document Understanding with Group Position Embedding: A Novel Approach to Incorporate Layout Information
Yuke Zhu, Yue Zhang, Dongdong Liu et al.
ICLR 2025
2
citations
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback
Boyuan Chen, Donghai Hong, Jiaming Ji et al.
NEURIPS 2025spotlightarXiv:2505.23950
1
citations
NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding
Wei Xu, Cheng Wang, Dingkang Liang et al.
NEURIPS 2025arXiv:2510.27481
2
citations
OpenMMEgo: Enhancing Egocentric Understanding for LMMs with Open Weights and Data
Hao Luo, Zihao Yue, Wanpeng Zhang et al.
NEURIPS 2025oral
Towards General Visual-Linguistic Face Forgery Detection
Ke Sun, Shen Chen, Taiping Yao et al.
CVPR 2025arXiv:2307.16545
37
citations