Oral "instruction-tuning dataset" Papers
2 papers found
Conference
Aligned Better, Listen Better for Audio-Visual Large Language Models
Yuxin Guo, Shuailei Ma, Shijie Ma et al.
ICLR 2025oralarXiv:2504.02061
9
citations
DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding
Weihao Xuan, Junjue Wang, Heli Qi et al.
NEURIPS 2025oralarXiv:2505.21076
10
citations