"parallel inference" Papers
2 papers found
Conference
Accelerating Parallel Diffusion Model Serving with Residual Compression
Jiajun Luo, Yicheng Xiao, Jianru Xu et al.
NEURIPS 2025oralarXiv:2507.17511
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Bhishma Dedhia, David Bourgin, Krishna Kumar Singh et al.
ICCV 2025arXiv:2503.17539
1
citations