α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Rongjie Huang
Rongjie Huang
11
papers
269
total citations
papers (11)
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
ICLR 2025
arXiv
132
citations
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech
NEURIPS 2022
arXiv
36
citations
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
ICCV 2023
arXiv
29
citations
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion
ICML 2024
arXiv
19
citations
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
AAAI 2025
arXiv
18
citations
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
ICLR 2025
arXiv
13
citations
OmniAudio: Generating Spatial Audio from 360-Degree Video
ICML 2025
arXiv
13
citations
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation
ICLR 2025
9
citations
InstructSpeech: Following Speech Editing Instructions via Large Language Models
ICML 2024
0
citations
UniAudio: Towards Universal Audio Generation with Large Language Models
ICML 2024
0
citations
M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus
NEURIPS 2022
0
citations