ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Yizhuo Li

Yizhuo Li

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 14, 2026, 7:30 PM AMS

10

papers

2,080

total citations

papers (10)

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model

Unmasked Teacher: Towards Training-Efficient Video Foundation Models

HOI Analysis: Integrating and Decomposing Human-Object Interaction

NEURIPS 2020arXiv

Test-Time Personalization with a Transformer for Human Pose Estimation

NEURIPS 2021arXiv

Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos

PGT: A Progressive Method for Training Models on Long Videos

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding