"audio generation" Papers
7 papers found
Conference
A Curious Case of the Missing Measure: Better Scores and Worse Generation
Joseph Turian, Jordie Shier
ICLR 2025
Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Yongxin Zhu, Bocheng Li, Yifei Xin et al.
ICCV 2025arXiv:2411.02038
49
citations
Towards A Translative Model of Sperm Whale Vocalization
Orr Paradise, Liangyuan Chen, Pranav Muralikrishnan et al.
NEURIPS 2025arXiv:2512.02206
1
citations
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
Changan Chen, Puyuan Peng, Ami Baid et al.
ECCV 2024arXiv:2406.09272
21
citations
Audio Generation with Multiple Conditional Diffusion Model
Zhifang Guo, Jianguo Mao, Tao Rui et al.
AAAI 2024paperarXiv:2308.11940
32
citations
Fast Timing-Conditioned Latent Audio Diffusion
Zach Evans, CJ Carr, Josiah Taylor et al.
ICML 2024arXiv:2402.04825
199
citations
UniAudio: Towards Universal Audio Generation with Large Language Models
Dongchao Yang, Jinchuan Tian, Xu Tan et al.
ICML 2024