Paper "cross-modal generation" Papers
2 papers found
Conference
UniMuMo: Unified Text, Music, and Motion Generation
Han Yang, Kun Su, Yutong Zhang et al.
AAAI 2025paperarXiv:2410.04534
12
citations
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models
Heng Wang, Jianbo Ma, Santiago Pascual et al.
AAAI 2024paperarXiv:2308.09300
75
citations