Poster "dense prediction tasks" Papers

17 papers found

DAViD: Data-efficient and Accurate Vision Models from Synthetic Data

Fatemeh Saleh, Sadegh Aliakbarian, Charlie Hewitt et al.

ICCV 2025arXiv:2507.15365
3
citations

DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations

Krishna Sri Ipsit Mantri, Carola-Bibiane Schönlieb, Bruno Ribeiro et al.

CVPR 2025arXiv:2502.06029
5
citations

Exploring Structural Degradation in Dense Representations for Self-supervised Learning

Siran Dai, Qianqian Xu, Peisong Wen et al.

NEURIPS 2025arXiv:2510.17299
1
citations

Learning Yourself: Class-Incremental Semantic Segmentation with Language-Inspired Bootstrapped Disentanglement

Ruitao Wu, Yifan Zhao, Jia Li

ICCV 2025arXiv:2509.00527
1
citations

Multi-Task Dense Predictions via Unleashing the Power of Diffusion

Yuqi Yang, Peng-Tao Jiang, Qibin Hou et al.

ICLR 2025
6
citations

Refining CLIP's Spatial Awareness: A Visual-Centric Perspective

Congpei Qiu, Yanhao Wu, Wei Ke et al.

ICLR 2025arXiv:2504.02328
7
citations

Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation

Mehrdad Noori, David OSOWIECHI, Gustavo Vargas Hakim et al.

NEURIPS 2025arXiv:2505.21844
4
citations

Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction

Yunheng Li, Yuxuan Li, Quan-Sheng Zeng et al.

ICCV 2025arXiv:2412.06244
6
citations

Denoising Vision Transformers

Jiawei Yang, Katie Luo, Jiefeng Li et al.

ECCV 2024arXiv:2401.02957
31
citations

Efficient Learning of Event-based Dense Representation using Hierarchical Memories with Adaptive Update

Uday Kamal, Saibal Mukhopadhyay

ECCV 2024
4
citations

Event Camera Data Dense Pre-training

Yan Yang, Liyuan Pan, Liu liu

ECCV 2024arXiv:2311.11533
15
citations

Exploiting Diffusion Prior for Generalizable Dense Prediction

Hsin-Ying Lee, Hung-Yu Tseng, Hsin-Ying Lee et al.

CVPR 2024arXiv:2311.18832
45
citations

Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping

Hyeongjun Kwon, Jinhyun Jang, Jin Kim et al.

CVPR 2024arXiv:2404.00974
10
citations

Removing Rows and Columns of Tokens in Vision Transformer enables Faster Dense Prediction without Retraining

Diwei Su, cheng fei, Jianxu Luo

ECCV 2024
2
citations

SILC: Improving Vision Language Pretraining with Self-Distillation

Muhammad Ferjad Naeem, Yongqin Xian, Xiaohua Zhai et al.

ECCV 2024arXiv:2310.13355
59
citations

Stitched ViTs are Flexible Vision Backbones

Zizheng Pan, Jing Liu, Haoyu He et al.

ECCV 2024arXiv:2307.00154
4
citations

Unsqueeze [CLS] Bottleneck to Learn Rich Representations

Qing Su, Shihao Ji

ECCV 2024arXiv:2407.17671