α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Ji Zhang
Ji Zhang
OpenReview
1
Affiliations
Affiliations
Alibaba
14
papers
1,099
total citations
papers (14)
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration
CVPR 2024
arXiv
614
citations
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
CVPR 2024
arXiv
121
citations
Shifting More Attention to Visual Backbone: Query-Modulated Refinement Networks for End-to-End Visual Grounding
CVPR 2022
arXiv
93
citations
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training
ICCV 2023
arXiv
92
citations
DePT: Decoupled Prompt Tuning
CVPR 2024
arXiv
62
citations
SubT-MRS Dataset: Pushing SLAM Towards All-weather Environments
CVPR 2024
arXiv
52
citations
DETA: Denoised Task Adaptation for Few-Shot Learning
ICCV 2023
arXiv
28
citations
Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves
CVPR 2025
arXiv
9
citations
A Simple yet Effective Layout Token in Large Language Models for Document Understanding
CVPR 2025
arXiv
7
citations
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
CVPR 2025
arXiv
7
citations
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
CVPR 2025
arXiv
6
citations
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training
AAAI 2024
arXiv
6
citations
MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
ICCV 2025
arXiv
2
citations
Accurate Few-Shot Object Detection With Support-Query Mutual Guidance and Hybrid Loss
CVPR 2021
0
citations