α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jeongsoo Choi
Jeongsoo Choi
8
papers
169
total citations
papers (8)
Watch or Listen: Robust Audio-Visual Speech Recognition With Visual Corruption Modeling and Reliability Scoring
CVPR 2023
arXiv
54
citations
DiffV2S: Diffusion-Based Video-to-Speech Synthesis with Vision-Guided Speaker Embedding
ICCV 2023
arXiv
32
citations
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge
ICCV 2023
arXiv
28
citations
ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation
ICLR 2025
arXiv
28
citations
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation
CVPR 2024
arXiv
16
citations
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
ICCV 2025
arXiv
6
citations
From Faces to Voices: Learning Hierarchical Representations for High-quality Video-to-Speech
CVPR 2025
arXiv
5
citations
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation
ICCV 2025
arXiv
0
citations