"video object segmentation" Papers

24 papers found

A Distractor-Aware Memory for Visual Object Tracking with SAM2

Alan Lukezic, Jovana Videnović, Matej Kristan

CVPR 2025arXiv:2411.17576
42
citations

Efficient Track Anything

Yunyang Xiong, Chong Zhou, Xiaoyu Xiang et al.

ICCV 2025arXiv:2411.18933
15
citations

Learning Spatial-Semantic Features for Robust Video Object Segmentation

Xin Li, Deshui Miao, Zhenyu He et al.

ICLR 2025arXiv:2407.07760
5
citations

M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation

Zixuan Chen, Jiaxin Li, Junxuan Liang et al.

CVPR 2025arXiv:2412.13803

Multi-Granularity Video Object Segmentation

Sangbeom Lim, Seongchan Kim, Seungjun An et al.

AAAI 2025paperarXiv:2412.01471

ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users

Xiangyu Yin, Boyuan Yang, Weichen Liu et al.

ICCV 2025highlightarXiv:2507.10223
1
citations

Robust Ego-Exo Correspondence with Long-Term Memory

Yijun Hu, Bing Fan, Xin Gu et al.

NEURIPS 2025arXiv:2510.11417
1
citations

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Shuangrui Ding, Rui Qian, Xiaoyi Dong et al.

ICCV 2025arXiv:2410.16268
56
citations

SMITE: Segment Me In TimE

Amirhossein Alimohammadi, Sauradip Nag, Saeid Asgari et al.

ICLR 2025arXiv:2410.18538
7
citations

Tracking and Understanding Object Transformations

Yihong Sun, Xinyu Yang, Jennifer Sun et al.

NEURIPS 2025oralarXiv:2511.04678

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

ECCV 2024arXiv:2407.07402
9
citations

Audio-Visual Segmentation via Unlabeled Frame Exploitation

Jinxiang Liu, Yikun Liu, Ferenas et al.

CVPR 2024arXiv:2403.11074
27
citations

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

Shuangrui Ding, Rui Qian, Haohang Xu et al.

ECCV 2024arXiv:2311.17893
11
citations

Fast Sprite Decomposition from Animated Graphics

Tomoyuki Suzuki, Kotaro Kikuchi, Kota Yamaguchi

ECCV 2024arXiv:2408.03923
3
citations

Improving Video Segmentation via Dynamic Anchor Queries

Yikang Zhou, Tao Zhang, Xiangtai Li et al.

ECCV 2024arXiv:2404.00086
20
citations

No More Shortcuts: Realizing the Potential of Temporal Self-Supervision

Ishan Rajendrakumar Dave, Simon Jenni, Mubarak Shah

AAAI 2024paperarXiv:2312.13008
12
citations

OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework

Wanyun Li, Pinxue Guo, Xinyu Zhou et al.

ECCV 2024arXiv:2403.08682
11
citations

PSALM: Pixelwise Segmentation with Large Multi-modal Model

Zheng Zhang, YeYao Ma, Enming Zhang et al.

ECCV 2024arXiv:2403.14598
83
citations

Putting the Object Back into Video Object Segmentation

Ho Kei Cheng, Seoung Wug Oh, Brian Price et al.

CVPR 2024highlightarXiv:2310.12982
185
citations

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation

Shilin Yan, Renrui Zhang, Ziyu Guo et al.

AAAI 2024paperarXiv:2305.16318
58
citations

RMem: Restricted Memory Banks Improve Video Object Segmentation

Junbao Zhou, Ziqi Pang, Yu-Xiong Wang

CVPR 2024arXiv:2406.08476
20
citations

Semi-supervised Active Learning for Video Action Detection

Ayush Singh, Aayush J Rana, Akash Kumar et al.

AAAI 2024paperarXiv:2312.07169
17
citations

Understanding Video Transformers via Universal Concept Discovery

Matthew Kowal, Achal Dave, Rares Andrei Ambrus et al.

CVPR 2024highlightarXiv:2401.10831
18
citations

VideoMAC: Video Masked Autoencoders Meet ConvNets

Gensheng Pei, Tao Chen, Xiruo Jiang et al.

CVPR 2024arXiv:2402.19082
21
citations