α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yiwu Zhong
Yiwu Zhong
13
papers
2,753
total citations
papers (13)
Grounded Language-Image Pre-Training
CVPR 2022
arXiv
1,431
citations
RegionCLIP: Region-Based Language-Image Pretraining
CVPR 2022
arXiv
781
citations
Comprehensive Image Captioning via Scene Graph Decomposition
ECCV 2020
arXiv
140
citations
Towards Learning a Generalist Model for Embodied Navigation
CVPR 2024
arXiv
118
citations
Learning Concise and Descriptive Attributes for Visual Recognition
ICCV 2023
arXiv
88
citations
Learning To Generate Scene Graph From Natural Language Supervision
ICCV 2021
arXiv
87
citations
Learning Procedure-Aware Video Representation From Instructional Videos and Their Narrations
CVPR 2023
arXiv
48
citations
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning
ICCV 2025
arXiv
24
citations
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods
CVPR 2024
18
citations
Revisiting Tampered Scene Text Detection in the Era of Generative AI
AAAI 2025
arXiv
12
citations
Fine-grained Spatiotemporal Grounding on Egocentric Videos
ICCV 2025
arXiv
5
citations
PAVE: Patching and Adapting Video Large Language Models
CVPR 2025
arXiv
1
citations
A Simple Baseline for Weakly-Supervised Scene Graph Generation
ICCV 2021
0
citations