α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Ming Yan
Ming Yan
19
papers
1,314
total citations
papers (19)
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration
CVPR 2024
arXiv
614
citations
FedRolex: Model-Heterogeneous Federated Learning with Rolling Sub-Model Extraction
NEURIPS 2022
arXiv
209
citations
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
CVPR 2024
arXiv
121
citations
Shifting More Attention to Visual Backbone: Query-Modulated Refinement Networks for End-to-End Visual Grounding
CVPR 2022
arXiv
93
citations
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training
ICCV 2023
arXiv
92
citations
WritingBench: A Comprehensive Benchmark for Generative Writing
NEURIPS 2025
arXiv
46
citations
Communication-Efficient Topologies for Decentralized Learning with $O(1)$ Consensus Rate
NEURIPS 2022
arXiv
43
citations
CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions
CVPR 2023
arXiv
26
citations
ErrorCompensatedX: error compensation for variance reduced algorithms
NEURIPS 2021
arXiv
11
citations
Improved Visual Fine-tuning with Natural Language Supervision
ICCV 2023
arXiv
10
citations
BUS: Efficient and Effective Vision-Language Pre-Training with Bottom-Up Patch Summarization.
ICCV 2023
arXiv
9
citations
RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method
CVPR 2024
arXiv
9
citations
Learning Trajectory-Word Alignments for Video-Language Tasks
ICCV 2023
arXiv
7
citations
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
CVPR 2025
arXiv
7
citations
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
CVPR 2025
arXiv
6
citations
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training
AAAI 2024
arXiv
6
citations
RoDA: Robust Domain Alignment for Cross-Domain Retrieval Against Label Noise
AAAI 2025
5
citations
DiDA: Disambiguated Domain Alignment for Cross-Domain Retrieval with Partial Labels
AAAI 2024
0
citations
ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate
CVPR 2025
arXiv
0
citations