Poster "audio question answering" Papers
3 papers found
Conference
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Ziyang Ma, Yinghao Ma, Yanqiao Zhu et al.
NEURIPS 2025arXiv:2505.13032
57
citations
BAT: Learning to Reason about Spatial Sounds with Large Language Models
Zhisheng Zheng, Puyuan Peng, Ziyang Ma et al.
ICML 2024arXiv:2402.01591
40
citations
Listen, Think, and Understand
Yuan Gong, Hongyin Luo, Alexander Liu et al.
ICLR 2024arXiv:2305.10790
224
citations