"zero-shot text-to-speech" Papers
3 papers found
Conference
SECodec: Structural Entropy-based Compressive Speech Representation Codec for Speech Language Models
Linqin Wang, Yaping Liu, Zhengtao Yu et al.
AAAI 2025paperarXiv:2501.00018
2
citations
TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling
Yuancheng Wang, Dekun Chen, Xueyao Zhang et al.
NEURIPS 2025arXiv:2508.16790
6
citations
Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis
Tianrui Wang, Haoyu Wang, Meng Ge et al.
NEURIPS 2025arXiv:2509.24629