"instruction-tuning dataset" Papers
4 papers found
Conference
AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models
Ziyin Zhou, Yunpeng Luo, Yuanchen Wu et al.
ICCV 2025arXiv:2507.02664
13
citations
Aligned Better, Listen Better for Audio-Visual Large Language Models
Yuxin Guo, Shuailei Ma, Shijie Ma et al.
ICLR 2025oralarXiv:2504.02061
9
citations
A Unified Framework for Motion Reasoning and Generation in Human Interaction
Jeongeun Park, Sungjoon Choi, Sangdoo Yun
ICCV 2025arXiv:2410.05628
2
citations
DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding
Weihao Xuan, Junjue Wang, Heli Qi et al.
NEURIPS 2025oralarXiv:2505.21076
10
citations