α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yizhuo Li
Yizhuo Li
OpenReview
10
papers
2,080
total citations
papers (10)
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
CVPR 2024
arXiv
902
citations
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
ICLR 2024
arXiv
419
citations
TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model
CVPR 2020
arXiv
266
citations
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
ICCV 2023
arXiv
246
citations
HOI Analysis: Integrating and Decomposing Human-Object Interaction
NEURIPS 2020
arXiv
148
citations
Test-Time Personalization with a Transformer for Human Pose Estimation
NEURIPS 2021
arXiv
55
citations
Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
ICCV 2025
arXiv
22
citations
PGT: A Progressive Method for Training Models on Long Videos
CVPR 2021
arXiv
13
citations
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation
CVPR 2025
arXiv
9
citations
UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding
ICCV 2023
0
citations