Region-Based Representations Revisited

14citations
arXiv:2402.02352
14
citations
#1558
in CVPR 2024
of 2716 papers
10
Top Authors
4
Data Points

Abstract

We investigate whether region-based representations are effective for recognition. Regions were once a mainstay in recognition approaches, but pixel and patch-based features are now used almost exclusively. We show that recent class-agnostic segmenters like SAM can be effectively combined with strong unsupervised representations like DINOv2 and used for a wide variety of tasks, including semantic segmentation, object-based image retrieval, and multi-image analysis. Once the masks and features are extracted, these representations, even with linear decoders, enable competitive performance, making them well suited to applications that require custom queries. The compactness of the representation also makes it well-suited to video analysis and other problems requiring inference across many images.

Citation History

Jan 27, 2026
14
Feb 7, 2026
14
Feb 13, 2026
14
Feb 13, 2026
14