"cross-modal fusion" Papers
6 papers found
Conference
Complementary Advantages: Exploiting Cross-Field Frequency Correlation for NIR-Assisted Image Denoising
Yuchen Wang, Hongyuan Wang, Lizhi Wang et al.
CVPR 2025arXiv:2412.16645
5
citations
Exploring Historical Information for RGBE Visual Tracking with Mamba
Chuanyu Sun, Jiqing Zhang, Yang Wang et al.
CVPR 2025
7
citations
Learnable Retrieval Enhanced Visual-Text Alignment and Fusion for Radiology Report Generation
Qin Zhou, Guoyan Liang, Xindi Li et al.
ICCV 2025arXiv:2507.07568
RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis
Haolin Li, Tianjie Dai, Zhe Chen et al.
NEURIPS 2025arXiv:2509.19980
Multi-modal Crowd Counting via a Broker Modality
Haoliang Meng, Xiaopeng Hong, Chenhao Wang et al.
ECCV 2024arXiv:2407.07518
12
citations
Sketch2Vox: Learning 3D Reconstruction from a Single Monocular Sketch Image
Fei Wang
ECCV 2024
3
citations