Poster "text-to-audio generation" Papers
4 papers found
Conference
Video-Guided Foley Sound Generation with Multimodal Controls
Ziyang Chen, Prem Seetharaman, Bryan Russell et al.
CVPR 2025arXiv:2411.17698
40
citations
VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation
Saksham Singh Kushwaha, Yapeng Tian
CVPR 2025arXiv:2412.10768
12
citations
Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models
Neta Shaul, Uriel Singer, Ricky T. Q. Chen et al.
ICML 2024arXiv:2403.01329
6
citations
Creative Text-to-Audio Generation via Synthesizer Programming
Manuel Cherep, Nikhil Singh, Jessica Shand
ICML 2024arXiv:2406.00294
10
citations