Spotlight "token compression" Papers
2 papers found
Conference
Towards Interpretable and Efficient Attention: Compressing All by Contracting a Few
Qishuai Wen, Zhiyuan Huang, Chun-Guang Li
NEURIPS 2025spotlightarXiv:2509.16875
1
citations
Vision-centric Token Compression in Large Language Model
Ling Xing, Alex Jinpeng Wang, Rui Yan et al.
NEURIPS 2025spotlightarXiv:2502.00791
11
citations