Paper "multi-modal learning" Papers
9 papers found
Conference
Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning
Yunbin Tu, Liang Li, Li Su et al.
AAAI 2025paperarXiv:2412.13543
1
citations
AVSegFormer: Audio-Visual Segmentation with Transformer
Shengyi Gao, Zhe Chen, Guo Chen et al.
AAAI 2024paperarXiv:2307.01146
82
citations
COMMA: Co-articulated Multi-Modal Learning
Authors: Lianyu Hu, Liqing Gao, Zekang Liu et al.
AAAI 2024paperarXiv:2401.00268
7
citations
FedDAT: An Approach for Foundation Model Finetuning in Multi-Modal Heterogeneous Federated Learning
Haokun Chen, Yao Zhang, Denis Krompass et al.
AAAI 2024paperarXiv:2308.12305
86
citations
LAMM: Label Alignment for Multi-Modal Prompt Learning
Jingsheng Gao, Jiacheng Ruan, Suncheng Xiang et al.
AAAI 2024paperarXiv:2312.08212
30
citations
MESED: A Multi-Modal Entity Set Expansion Dataset with Fine-Grained Semantic Classes and Hard Negative Entities
Li Yangning, Tingwei Lu, Hai-Tao Zheng et al.
AAAI 2024paperarXiv:2307.14878
20
citations
MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding
HaiTao Yu, Mofei Song
AAAI 2024paperarXiv:2402.10002
18
citations
Mono3DVG: 3D Visual Grounding in Monocular Images
Yangfan Zhan, Yuan Yuan, Zhitong Xiong
AAAI 2024paperarXiv:2312.08022
36
citations
Multi-Label Supervised Contrastive Learning
Pingyue Zhang, Mengyue Wu
AAAI 2024paperarXiv:2410.13439
1
citations