"audio-visual speech recognition" Papers
2 papers found
Conference
Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations
Jeong Hun Yeo, Minsu Kim, Chae Won Kim et al.
ICCV 2025arXiv:2503.06273
5
citations
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation
Qiushi Zhu, Jie Zhang, Yu Gu et al.
AAAI 2024paperarXiv:2401.03468
15
citations