"vision-to-audio generation" Papers
2 papers found
Conference
From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation
Kun Su, Xiulong Liu, Eli Shlizerman
ICML 2024arXiv:2409.19132
17
citations
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models
Heng Wang, Jianbo Ma, Santiago Pascual et al.
AAAI 2024paperarXiv:2308.09300
75
citations