α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xu Tan
Xu Tan
1
Affiliations
Affiliations
Microsoft Research Asia
18
papers
3,792
total citations
papers (18)
MPNet: Masked and Permuted Pre-training for Language Understanding
NEURIPS 2020
arXiv
1,506
citations
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
NEURIPS 2023
arXiv
1,267
citations
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
ICML 2024
arXiv
306
citations
Semi-Supervised Neural Architecture Search
NEURIPS 2020
arXiv
98
citations
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
NEURIPS 2023
arXiv
93
citations
Museformer: Transformer with Fine- and Coarse-Grained Attention for Music Generation
NEURIPS 2022
arXiv
81
citations
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
NEURIPS 2022
arXiv
77
citations
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
NEURIPS 2021
arXiv
76
citations
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
AAAI 2025
arXiv
75
citations
PromptTTS 2: Describing and Generating Voices with Text Prompt
ICLR 2024
arXiv
73
citations
GAIA: Zero-shot Talking Avatar Generation
ICLR 2024
arXiv
46
citations
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
CVPR 2025
arXiv
32
citations
HiFace: High-Fidelity 3D Face Reconstruction by Learning Static and Dynamic Details
ICCV 2023
arXiv
31
citations
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
AAAI 2025
arXiv
20
citations
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling
NEURIPS 2022
arXiv
11
citations
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation
ICCV 2025
arXiv
0
citations
UniAudio: Towards Universal Audio Generation with Large Language Models
ICML 2024
0
citations
Speech-T: Transducer for Text to Speech and Beyond
NEURIPS 2021
0
citations