α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yinan He
Yinan He
OpenReview
16
papers
3,900
total citations
papers (16)
VBench: Comprehensive Benchmark Suite for Video Generative Models
CVPR 2024
arXiv
1,072
citations
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
CVPR 2024
arXiv
902
citations
VideoMAE V2: Scaling Video Masked Autoencoders With Dual Masking
CVPR 2023
arXiv
557
citations
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
ICLR 2024
arXiv
419
citations
VideoMamba: State Space Model for Efficient Video Understanding
ECCV 2024
arXiv
407
citations
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
ICCV 2023
arXiv
246
citations
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
CVPR 2021
arXiv
183
citations
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
ICLR 2025
arXiv
49
citations
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
CVPR 2025
arXiv
20
citations
VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception
NEURIPS 2025
arXiv
16
citations
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos
ICCV 2025
arXiv
9
citations
X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation
ECCV 2022
arXiv
7
citations
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models
NEURIPS 2025
arXiv
7
citations
DiffVSR: Revealing an Effective Recipe for Taming Robust Video Super-Resolution Against Complex Degradations
ICCV 2025
arXiv
5
citations
WISNet: Pseudo Label Generation on Unbalanced and Patch Annotated Waste Images
CVPR 2025
1
citations
UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding
ICCV 2023
0
citations