Poster by Fan Lai Papers
2 papers found
Conference
HyGen: Efficient LLM Serving via Elastic Online-Offline Request Co-location
Ting Sun, Penghan Wang, Fan Lai
NEURIPS 2025arXiv:2501.14808
7
citations
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models
Haoyi Song, Ruihan Ji, Naichen Shi et al.
NEURIPS 2025arXiv:2506.09684
2
citations