"open-vocabulary tasks" Papers
3 papers found
Conference
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Dahyun Kang, Piotr Bojanowski, Huy V. Vo et al.
CVPR 2025arXiv:2412.16334
46
citations
ResCLIP: Residual Attention for Training-free Dense Vision-language Inference
Jinhong Deng, Yuhang Yang, Wen Li et al.
CVPR 2025arXiv:2411.15851
11
citations
Open-Vocabulary Calibration for Fine-tuned CLIP
Shuoyuan Wang, Jindong Wang, Guoqing Wang et al.
ICML 2024arXiv:2402.04655
14
citations