"text-to-speech" Papers
4 papers found
Conference
FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Tian-Hao Zhang, Jiawei Zhang, Jun Wang et al.
AAAI 2025paperarXiv:2501.03181
2
citations
ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Xiangheng He, Junjie Chen, Zixing Zhang et al.
AAAI 2025paperarXiv:2412.11795
1
citations
VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model
Zuwei Long, Yunhang Shen, Chaoyou Fu et al.
NEURIPS 2025
17
citations
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Zeqian Ju, Yuancheng Wang, Kai Shen et al.
ICML 2024arXiv:2403.03100
306
citations