ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Jeongsoo Choi

Jeongsoo Choi

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 3:13 AM AMS

8

papers

169

total citations

papers (8)

Watch or Listen: Robust Audio-Visual Speech Recognition With Visual Corruption Modeling and Reliability Scoring

DiffV2S: Diffusion-Based Video-to-Speech Synthesis with Vision-Guided Speaker Embedding

Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge

ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation

AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models

From Faces to Voices: Learning Hierarchical Representations for High-quality Video-to-Speech

MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation