α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Rui Qian
Rui Qian
22
papers
3,678
total citations
papers (22)
Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation
CVPR 2021
arXiv
1,178
citations
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
NEURIPS 2021
arXiv
689
citations
Spatiotemporal Contrastive Video Representation Learning
CVPR 2021
arXiv
560
citations
End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
CVPR 2020
arXiv
189
citations
Multiple Sound Sources Localization from Coarse to Fine
ECCV 2020
arXiv
179
citations
Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
CVPR 2023
arXiv
157
citations
Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching
NEURIPS 2020
arXiv
149
citations
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
CVPR 2022
arXiv
135
citations
VideoPrism: A Foundational Visual Encoder for Video Understanding
ICML 2024
arXiv
73
citations
Motion-Aware Contrastive Video Representation Learning via Foreground-Background Merging
CVPR 2022
arXiv
63
citations
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
ICCV 2025
arXiv
56
citations
Enhancing Self-Supervised Video Representation Learning via Multi-Level Feature Optimization
ICCV 2021
arXiv
43
citations
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
CVPR 2025
arXiv
40
citations
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
CVPR 2025
arXiv
36
citations
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
ICCV 2023
arXiv
27
citations
Static and Dynamic Concepts for Self-Supervised Video Representation Learning
ECCV 2022
arXiv
27
citations
Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos
ICCV 2023
arXiv
21
citations
Contextualized Spatio-Temporal Contrastive Learning With Self-Supervision
CVPR 2022
arXiv
18
citations
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
ECCV 2022
arXiv
15
citations
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
ECCV 2024
arXiv
11
citations
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
ECCV 2024
arXiv
8
citations
Reasoning to Attend: Try to Understand How <SEG> Token Works
CVPR 2025
arXiv
4
citations