DoubleTake: Geometry Guided Depth Estimation

10citations

arXiv:2406.18387 PDF

citations

#1066

in ECCV 2024

of 2387 papers

Top Authors

Data Points

Top Authors

Mohamed Sayed Filippo Aleotti Jamie Watson Zawar Qureshi Guillermo Garcia-Hernando Gabriel Brostow Sara Vicente Michael Firman

Abstract

Estimating depth from a sequence of posed RGB images is a fundamental computer vision task, with applications in augmented reality, path planning etc. Prior work typically makes use of previous frames in a multi view stereo framework, relying on matching textures in a local neighborhood. In contrast, our model leverages historical predictions by giving the latest 3D geometry data as an extra input to our network. This self-generated geometric hint can encode information from areas of the scene not covered by the keyframes and it is more regularized when compared with individual predicted depth maps for previous frames. We introduce a Hint MLP which combines cost volume features with a hint of the prior geometry, rendered as a depth map from the current camera location, together with a measure of the confidence in the prior geometry. We demonstrate that our method, which can run at interactive speeds, achieves state-of-the-art estimates of depth and 3D scene reconstruction in both offline and incremental evaluation scenarios.

Citation History

Jan 25, 2026

Jan 26, 2026

Jan 28, 2026

Feb 13, 2026

10+10

Feb 13, 2026