Poster Papers

24,624 papers found • Page 489 of 493

ViTamin: Designing Scalable Vision Models in the Vision-Language Era

Jieneng Chen, Qihang Yu, Xiaohui Shen et al.

CVPR 2024arXiv:2404.02132
51
citations

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

Shicheng Li, Lei Li, Yi Liu et al.

ECCV 2024arXiv:2311.17404
49
citations

ViT-Lens: Towards Omni-modal Representations

Stan Weixian Lei, Yixiao Ge, Kun Yi et al.

CVPR 2024arXiv:2311.16081
32
citations

VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation

Wenjie Zhuo, Fan Ma, Hehe Fan et al.

ECCV 2024arXiv:2407.09822
18
citations

VkD: Improving Knowledge Distillation using Orthogonal Projections

Roy Miles, Ismail Elezi, Jiankang Deng

CVPR 2024
24
citations

VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition

Ahmad Khaliq, Ming Xu, Stephen Hausler et al.

ECCV 2024arXiv:2409.19293
6
citations

Vlogger: Make Your Dream A Vlog

Shaobin Zhuang, Kunchang Li, Xinyuan Chen et al.

CVPR 2024arXiv:2401.09414
66
citations

VLP: Vision Language Planning for Autonomous Driving

Chenbin Pan, Burhan Yaman, Tommaso Nesti et al.

CVPR 2024arXiv:2401.05577
132
citations

VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models

Hyeonho Jeong, Geon Yeong Park, Jong Chul Ye

CVPR 2024arXiv:2312.00845
89
citations

VNN: Verification-Friendly Neural Networks with Hard Robustness Guarantees

Anahita Baninajjar, Ahmed Rezine, Amir Aminifar

ICML 2024arXiv:2312.09748
1
citations

VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis

Linshan Wu, Jia-Xin Zhuang, Hao Chen

CVPR 2024arXiv:2402.17300
76
citations

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Hubert Siuzdak

ICLR 2024arXiv:2306.00814
192
citations

Volumetric Rendering with Baked Quadrature Fields

Gopal Sharma, Daniel Rebain, Kwang Moo Yi et al.

ECCV 2024arXiv:2312.02202
10
citations

VOODOO 3D: Volumetric Portrait Disentanglement For One-Shot 3D Head Reenactment

Phong Tran, Egor Zakharov, Long Nhat Ho et al.

CVPR 2024arXiv:2312.04651
32
citations

VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model

Pengying Wu, Yao Mu, Bingxian Wu et al.

ICML 2024arXiv:2401.02695
69
citations

VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation

Yang Chen, Yingwei Pan, haibo yang et al.

CVPR 2024arXiv:2403.17001
30
citations

VP-SAM: Taming Segment Anything Model for Video Polyp Segmentation via Disentanglement and Spatio-temporal Side Network

Zhixue Fang, Yuzhi Liu, Huisi Wu et al.

ECCV 2024
2
citations

VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving

Yibo Liu, Zheyuan Yang, Guile Wu et al.

ECCV 2024arXiv:2407.06516
10
citations

VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling

Siyuan Li, Zedong Wang, Zicheng Liu et al.

ICML 2024arXiv:2405.10812
16
citations

VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs

Ling Yang, Ye Tian, Minkai Xu et al.

ICLR 2024arXiv:2308.02117
36
citations

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda et al.

ECCV 2024arXiv:2312.08291
10
citations

VQ-TR: Vector Quantized Attention for Time Series Forecasting

Kashif Rasul, Andrew Bennett, Pablo Vicente et al.

ICLR 2024

VRetouchEr: Learning Cross-frame Feature Interdependence with Imperfection Flow for Face Retouching in Videos

Wen Xue, Le Jiang, Lianxin Xie et al.

CVPR 2024
1
citations

VRP-SAM: SAM with Visual Reference Prompt

Yanpeng Sun, Jiahui Chen, Shan Zhang et al.

CVPR 2024arXiv:2402.17726
87
citations

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

Ziyang Luo, Nian Liu, Wangbo Zhao et al.

CVPR 2024arXiv:2311.15011
107
citations

VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection

Zihua Liu, Hiroki Sakuma, Masatoshi Okutomi

CVPR 2024arXiv:2404.00149
5
citations

VS: Reconstructing Clothed 3D Human from Single Image via Vertex Shift

Leyuan Liu, Yuhan Li, Yunqi Gao et al.

CVPR 2024

VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViG

Yankun Xu, Junzhe Wang, Yun-Hsuan Chen et al.

ECCV 2024arXiv:2311.14775
5
citations

VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning

Kang Chen, Xiangqian Wu

CVPR 2024arXiv:2303.02635
20
citations

V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation

Pooja Guhan, Tsung-Wei Huang, Guan-Ming Su et al.

ECCV 2024arXiv:2501.07983

Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs

Mattia Segu, Luigi Piccinelli, Siyuan Li et al.

ECCV 2024
3
citations

WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion

Khiem Vuong, N. Dinesh Reddy, Robert Tamburo et al.

CVPR 2024arXiv:2403.19022
3
citations

WANDR: Intention-guided Human Motion Generation

Markos Diomataris, Nikos Athanasiou, Omid Taheri et al.

CVPR 2024arXiv:2404.15383
29
citations

WARM: On the Benefits of Weight Averaged Reward Models

Alexandre Rame, Nino Vieillard, Léonard Hussenot et al.

ICML 2024

WAS: Dataset and Methods for Artistic Text Segmentation

Xudong Xie, Yuzhe Li, Yang Liu et al.

ECCV 2024arXiv:2408.00106
3
citations

Wasserstein Wormhole: Scalable Optimal Transport Distance with Transformer

Doron Haviv, Russell Kunes, Thomas Dougherty et al.

ICML 2024arXiv:2404.09411
11
citations

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos et al.

ECCV 2024arXiv:2409.17917
6
citations

Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination

Yunan LI, Yihao Zhang, Shoude Li et al.

ECCV 2024
6
citations

Watch Your Steps: Local Image and Scene Editing by Text Instructions

Ashkan Mirzaei, Tristan T Aumentado-Armstrong, Marcus A Brubaker et al.

ECCV 2024arXiv:2308.08947
49
citations

WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights

Youngdong Jang, Dong In Lee, MinHyuk Jang et al.

CVPR 2024arXiv:2405.02066
25
citations

Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models

Peifei Zhu, Tsubasa Takahashi, Hirokatsu Kataoka

CVPR 2024arXiv:2404.09401
34
citations

Watermarks in the Sand: Impossibility of Strong Watermarking for Language Models

Hanlin Zhang, Benjamin Edelman, Danilo Francati et al.

ICML 2024

Watermark Stealing in Large Language Models

Nikola Jovanović, Robin Staab, Martin Vechev

ICML 2024arXiv:2402.19361
75
citations

WaveFace: Authentic Face Restoration with Efficient Frequency Recovery

Yunqi Miao, Jiankang Deng, Jungong Han

CVPR 2024arXiv:2403.12760
18
citations

Wavelength-Embedding-guided Filter-Array Transformer for Spectral Demosaicing

haijin zeng, Hiep Luong, Wilfried Philips

ECCV 2024
1
citations

Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration

Chen Zhao, Weiling Cai, Chenyu Dong et al.

CVPR 2024arXiv:2311.16845
147
citations

Wavelet Convolutions for Large Receptive Fields

Shahaf Finder, Roy Amoyal, Eran Treister et al.

ECCV 2024arXiv:2407.05848
348
citations

WaveMo: Learning Wavefront Modulations to See Through Scattering

Mingyang Xie, Haiyun Guo, Brandon Y. Feng et al.

CVPR 2024arXiv:2404.07985
8
citations

WAVES: Benchmarking the Robustness of Image Watermarks

Bang An, Mucong Ding, Tahseen Rabbani et al.

ICML 2024arXiv:2401.08573
72
citations

WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing

Yutang Feng, Sicheng Gao, Yuxiang Bao et al.

ECCV 2024