"modality interaction" Papers
2 papers found
Conference
Hierarchical Cross-modal Prompt Learning for Vision-Language Models
Hao Zheng, Shunzhi Yang, Zhuoxin He et al.
ICCV 2025arXiv:2507.14976
6
citations
Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning
Yunbin Tu, Liang Li, Li Su et al.
AAAI 2025paperarXiv:2412.13543
1
citations