Cross-View Completion Models are Zero-shot Correspondence Estimators
19citations
arXiv:2412.0907219
citations
#436
in CVPR 2025
of 2873 papers
7
Top Authors
7
Data Points
Abstract
In this work, we explore new perspectives on cross-view completion learning by drawing an analogy to self-supervised correspondence learning. Through our analysis, we demonstrate that the cross-attention map within cross-view completion models captures correspondence more effectively than other correlations derived from encoder or decoder features. We verify the effectiveness of the cross-attention map by evaluating on both zero-shot matching and learning-based geometric matching and multi-frame depth estimation. Project page is available at https://cvlab-kaist.github.io/ZeroCo/.
Citation History
Jan 25, 2026
0
Jan 26, 2026
0
Jan 26, 2026
0
Jan 28, 2026
0
Feb 13, 2026
19+19
Feb 13, 2026
19
Feb 13, 2026
19