"audio description generation" Papers
4 papers found
Conference
Contextual AD Narration with Interleaved Multimodal Sequence
Hanlin Wang, Zhan Tong, Kecheng Zheng et al.
CVPR 2025arXiv:2403.12922
8
citations
DistinctAD: Distinctive Audio Description Generation in Contexts
Bo Fang, Wenhao Wu, Qiangqiang Wu et al.
CVPR 2025highlightarXiv:2411.18180
4
citations
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation
Junyu Xie, Tengda Han, Max Bain et al.
ICCV 2025arXiv:2504.01020
3
citations
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning
Chaoyi Zhang, Kevin Lin, Zhengyuan Yang et al.
CVPR 2024highlightarXiv:2311.17435
50
citations