α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xie Chen
Xie Chen
9
papers
294
total citations
papers (9)
ELLA-V: Stable Neural Codec Language Modeling with Alignment-Guided Sequence Reordering
AAAI 2025
arXiv
66
citations
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
AAAI 2024
arXiv
59
citations
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
NEURIPS 2025
arXiv
57
citations
Language Model Can Listen While Speaking
AAAI 2025
arXiv
51
citations
BAT: Learning to Reason about Spatial Sounds with Large Language Models
ICML 2024
arXiv
40
citations
Speech Recognition Meets Large Language Model: Benchmarking, Models, and Exploration
AAAI 2025
12
citations
VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization
AAAI 2025
arXiv
8
citations
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video
ICCV 2025
arXiv
1
citations
Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis
NEURIPS 2025
arXiv
0
citations