"object detection" Papers

132 papers found • Page 3 of 3

One Step Learning, One Step Review

Huang Xiaolong, Qiankun Li, Xueran Li et al.

AAAI 2024paperarXiv:2401.10962
2
citations

Open-Set Recognition in the Age of Vision-Language Models

Dimity Miller, Niko Suenderhauf, Alex Kenna et al.

ECCV 2024arXiv:2403.16528
10
citations

PAD: Patch-Agnostic Defense against Adversarial Patch Attacks

Lihua Jing, Rui Wang, Wenqi Ren et al.

CVPR 2024arXiv:2404.16452
41
citations

PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking

Jiahuan Long, Tingsong Jiang, Wen Yao et al.

ECCV 2024arXiv:2504.09361
4
citations

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Junyi Li, Junfeng Wu, Weizhi Zhao et al.

ECCV 2024arXiv:2407.16696
13
citations

Receptive Fields As Experts in Convolutional Neural Architectures

Dongze Lian, Weihao Yu, Xinchao Wang

ICML 2024

Referring Expression Counting

Siyang Dai, Jun Liu, Ngai-Man Cheung

CVPR 2024highlightarXiv:2505.22850
3
citations

Relation DETR: Exploring Explicit Position Relation Prior for Object Detection

Xiuquan Hou, Meiqin Liu, Senlin Zhang et al.

ECCV 2024arXiv:2407.11699
64
citations

Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement

Xiuquan Hou, Meiqin Liu, Senlin Zhang et al.

CVPR 2024arXiv:2403.16131
82
citations

Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection

Tim Salzmann, Markus Ryll, Alex Bewley et al.

ECCV 2024arXiv:2403.14270
8
citations

SCoRe: Submodular Combinatorial Representation Learning

Anay Majee, Suraj Kothawade, Krishnateja Killamsetty et al.

ICML 2024arXiv:2310.00165
5
citations

SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection

JUNSU KIM, Hoseong Cho, Jihyeon Kim et al.

CVPR 2024highlightarXiv:2402.17323
50
citations

Seeing Faces in Things: A Model and Dataset for Pareidolia

Mark T Hamilton, Simon Stent, Vasha G DuTell et al.

ECCV 2024arXiv:2409.16143
5
citations

Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning

Kaiyou Song, Shan Zhang, Tong Wang

AAAI 2024paperarXiv:2312.10457
2
citations

Semantic-Aware Transformation-Invariant RoI Align

Guo-Ye Yang, Kiyohiro Nakayama, Zi-Kai Xiao et al.

AAAI 2024paperarXiv:2312.09609

SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design

Seokju Yun, Youngmin Ro

CVPR 2024arXiv:2401.16456
102
citations

Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights

Yan Hao, Florent Forest, Olga Fink

ECCV 2024arXiv:2407.07586
20
citations

SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization

Jialong Guo, Xinghao Chen, Yehui Tang et al.

ICML 2024arXiv:2405.11582
34
citations

SlowTrack: Increasing the Latency of Camera-Based Perception in Autonomous Driving Using Adversarial Examples

Chen Ma, Ningfei Wang, Qi Alfred Chen et al.

AAAI 2024paperarXiv:2312.09520
38
citations

Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once

Zhangheng Li, Shiwei Liu, Tianlong Chen et al.

ICML 2024

Task-Aware Encoder Control for Deep Video Compression

Xingtong Ge, Jixiang Luo, XINJIE ZHANG et al.

CVPR 2024arXiv:2404.04848
8
citations

Tensorial template matching for fast cross-correlation with rotations and its application for tomography

Antonio Martinez-Sanchez, Ulrike Homberg, J. M. Almira et al.

ECCV 2024arXiv:2408.02398
3
citations

Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients

Dohyung Kim, Junghyup Lee, Jeimin Jeon et al.

ECCV 2024arXiv:2407.12637
2
citations

Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models

Francesco Croce, Naman D. Singh, Matthias Hein

ECCV 2024arXiv:2306.12941
12
citations

UniFS: Universal Few-shot Instance Perception with Point Representations

Sheng Jin, Ruijie Yao, Lumin Xu et al.

ECCV 2024arXiv:2404.19401
4
citations

Unsqueeze [CLS] Bottleneck to Learn Rich Representations

Qing Su, Shihao Ji

ECCV 2024arXiv:2407.17671

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation

Yunhao Ge, Xiaohui Zeng, Jacob Huffman et al.

CVPR 2024arXiv:2404.19752
35
citations

Visual Transformer with Differentiable Channel Selection: An Information Bottleneck Inspired Approach

Yancheng Wang, Ping Li, Yingzhen Yang

ICML 2024

ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions

Chunlong Xia, Xinliang Wang, Feng Lv et al.

CVPR 2024highlightarXiv:2403.07392
133
citations

Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation

Prantik Howlader, Hieu Le, Dimitris Samaras

ECCV 2024arXiv:2407.12630
3
citations

What How and When Should Object Detectors Update in Continually Changing Test Domains?

Jayeon Yoo, Dongkwan Lee, Inseop Chung et al.

CVPR 2024arXiv:2312.08875
16
citations

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Chien-Yao Wang, I-Hau Yeh, Hong-Yuan Mark Liao

ECCV 2024arXiv:2402.13616
3033
citations