Paper "multi-modal fusion" Papers
6 papers found
Conference
JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts
Taein Son, Soo Won Seo, Jisong Kim et al.
AAAI 2025paperarXiv:2412.13708
2
citations
RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection
Yiheng Li, Yang Yang, Zhen Lei
AAAI 2025paperarXiv:2412.12799
6
citations
A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation
Yongkang Wang, Xuan Liu, Feng Huang et al.
AAAI 2024paperarXiv:2312.15665
27
citations
Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation
Yujun Chen, Xin Tan, Zhizhong Zhang et al.
AAAI 2024paperarXiv:2312.08234
6
citations
Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA
Wentao Mo, Yang Liu
AAAI 2024paperarXiv:2402.15933
26
citations
DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal Inconsistency
Wenfang Yao, Kejing Yin, William Cheung et al.
AAAI 2024paperarXiv:2403.06197
56
citations