α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Kunchang Li
Kunchang Li
OpenReview
15
papers
3,232
total citations
papers (15)
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
CVPR 2024
arXiv
902
citations
PointCLIP: Point Cloud Understanding by CLIP
CVPR 2022
arXiv
587
citations
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification
ECCV 2022
arXiv
477
citations
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
ICLR 2024
arXiv
419
citations
VideoMamba: State Space Model for Efficient Video Understanding
ECCV 2024
arXiv
407
citations
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
ICCV 2023
arXiv
246
citations
Vlogger: Make Your Dream A Vlog
CVPR 2024
arXiv
66
citations
Self-Slimmed Vision Transformer
ECCV 2022
arXiv
46
citations
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning
ECCV 2022
arXiv
35
citations
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
CVPR 2025
arXiv
20
citations
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
ICLR 2025
arXiv
10
citations
Make Your Training Flexible: Towards Deployment-Efficient Video Models
ICCV 2025
arXiv
6
citations
Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
AAAI 2025
arXiv
6
citations
V-Stylist: Video Stylization via Collaboration and Reflection of MLLM Agents
CVPR 2025
arXiv
5
citations
UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding
ICCV 2023
0
citations