Highlight "audio description generation" Papers
2 papers found
Conference
DistinctAD: Distinctive Audio Description Generation in Contexts
Bo Fang, Wenhao Wu, Qiangqiang Wu et al.
CVPR 2025highlightarXiv:2411.18180
4
citations
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning
Chaoyi Zhang, Kevin Lin, Zhengyuan Yang et al.
CVPR 2024highlightarXiv:2311.17435
50
citations