by Jiachen Zheng Papers
2 papers found
Conference
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer
Yuancheng Wang, Haoyue Zhan, Liwei Liu et al.
ICLR 2025arXiv:2409.00750
161
citations
Metis: A Foundation Speech Generation Model with Masked Generative Pre-training
Yuancheng Wang, Jiachen Zheng, Junan Zhang et al.
NEURIPS 2025arXiv:2502.03128
16
citations