ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Wei-Ning Hsu

Wei-Ning Hsu

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 14, 2026, 11:22 PM AMS

9

papers

875

total citations

papers (9)

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

NEURIPS 2023arXiv

Unsupervised Speech Recognition

NEURIPS 2021arXiv

u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality

NEURIPS 2022arXiv

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

NEURIPS 2023arXiv

Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos

FlowDec: A flow-based full-band general audio codec with high perceptual quality

MusicFlow: Cascaded Flow Matching for Text Guided Music Generation

Scaling Speech Technology to 1,000+ Languages

ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration