"zero-shot retrieval" Papers
10 papers found
Conference
Assessing and Learning Alignment of Unimodal Vision and Language Models
Le Zhang, Qian Yang, Aishwarya Agrawal
CVPR 2025highlightarXiv:2412.04616
15
citations
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
Edson Araujo, Andrew Rouditchenko, Yuan Gong et al.
CVPR 2025arXiv:2505.01237
2
citations
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization
Michael Green, Matan Levy, Issar Tzachor et al.
NEURIPS 2025arXiv:2503.07038
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos
Animesh Gupta, Jay Parmar, Ishan Rajendrakumar Dave et al.
NEURIPS 2025oralarXiv:2506.05274
1
citations
MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
Min Yang, Zihan Jia, Zhilin Dai et al.
ICCV 2025arXiv:2508.07312
WildSAT: Learning Satellite Image Representations from Wildlife Observations
Rangel Daroya, Elijah Cole, Oisin Mac Aodha et al.
ICCV 2025arXiv:2412.14428
10
citations
Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Omkar Thawakar, Muzammal Naseer, Rao Anwer et al.
CVPR 2024arXiv:2403.16997
21
citations
Data Roaming and Quality Assessment for Composed Image Retrieval
Matan Levy, Rami Ben-Ari, Nir Darshan et al.
AAAI 2024paperarXiv:2303.09429
55
citations
Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Samuel Lavoie, Polina Kirichenko, Mark Ibrahim et al.
ICML 2024arXiv:2405.00740
39
citations
STELLA: Continual Audio-Video Pre-training with SpatioTemporal Localized Alignment
Jaewoo Lee, Jaehong Yoon, Wonjae Kim et al.
ICML 2024oral