α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Wei-Ning Hsu
Wei-Ning Hsu
9
papers
875
total citations
papers (9)
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
NEURIPS 2023
arXiv
443
citations
Unsupervised Speech Recognition
NEURIPS 2021
arXiv
293
citations
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality
NEURIPS 2022
arXiv
52
citations
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
NEURIPS 2023
arXiv
37
citations
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
ECCV 2024
arXiv
21
citations
FlowDec: A flow-based full-band general audio codec with high perceptual quality
ICLR 2025
arXiv
16
citations
MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
ICML 2024
arXiv
13
citations
Scaling Speech Technology to 1,000+ Languages
ICML 2024
0
citations
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration
CVPR 2023
0
citations