"vision token compression" Papers
3 papers found
Conference
Compress & Cache: Vision token compression for efficient generation and retrieval
Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos
NEURIPS 2025
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models
Runhui Huang, Xinpeng Ding, Chunwei Wang et al.
CVPR 2025arXiv:2407.08706
15
citations
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
Shaolei Zhang, Qingkai Fang, Yang et al.
ICLR 2025arXiv:2501.03895
117
citations