Paper "modality gap reduction" Papers
2 papers found
Conference
Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering
Peize Li, Qingyi Si, Peng Fu et al.
AAAI 2025paperarXiv:2412.14880
1
citations
Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer
Yaoting Wang, Liu Weisong, Guangyao Li et al.
AAAI 2024paperarXiv:2309.07929
38
citations