"vision-text alignment" Papers
3 papers found
Conference
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation
Siyu Jiao, hongguang Zhu, Yunchao Wei et al.
ECCV 2024arXiv:2408.00744
36
citations
Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Omkar Thawakar, Muzammal Naseer, Rao Anwer et al.
CVPR 2024arXiv:2403.16997
21
citations
GalLop: Learning global and local prompts for vision-language models
Marc Lafon, Elias Ramzi, Clément Rambour et al.
ECCV 2024arXiv:2407.01400
39
citations