"cross-modal matching" Papers
7 papers found
Conference
Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language Alignment
Yang Liu, Mengyuan Liu, Shudong Huang et al.
AAAI 2025paperarXiv:2503.06974
6
citations
AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models
Kim Sung-Bin, Oh Hyun-Bin, Lee Jung-Mok et al.
ICLR 2025arXiv:2410.18325
19
citations
CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection
Zhixin Cheng, Jiacheng Deng, Xinjun Li et al.
ICCV 2025arXiv:2506.21364
1
citations
Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning
Jun Li, Jinpeng Wang, Chaolei Tan et al.
ICCV 2025arXiv:2507.17402
4
citations
Self-Supervised Spatial Correspondence Across Modalities
Ayush Shrivastava, Andrew Owens
CVPR 2025arXiv:2506.03148
2
citations
TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes
Yan Xia, Yunxiang Lu, Rui Song et al.
ICCV 2025arXiv:2412.10308
1
citations
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
Yuan Wang, Rui Sun, Naisong Luo et al.
CVPR 2024arXiv:2404.00262
25
citations