by Ziqiang Liu Papers
2 papers found
Conference
DEEM: Diffusion models serve as the eyes of large language models for image perception
Run Luo, Yunshui Li, Longze Chen et al.
ICLR 2025arXiv:2405.15232
34
citations
VCM: Vision Concept Modeling with Adaptive Vision Token Compression via Instruction Fine-Tuning
Run Luo, Renke Shan, Longze Chen et al.
NEURIPS 2025