"auto-regressive decoding" Papers
3 papers found
Conference
Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads
Zhoutong Wu, Yuan Zhang, Yiming Dong et al.
NEURIPS 2025arXiv:2510.16807
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models
Fushuo Huo, Wenchao Xu, Zhong Zhang et al.
ICLR 2025arXiv:2408.02032
68
citations
Self-Infilling Code Generation
Lin Zheng, Jianbo Yuan, Zhi Zhang et al.
ICML 2024arXiv:2311.17972
5
citations