α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Minghe Gao
Minghe Gao
7
papers
163
total citations
papers (7)
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
ICLR 2024
arXiv
90
citations
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models
ICCV 2023
arXiv
32
citations
STEP: Enhancing Video-LLMs’ Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training
CVPR 2025
arXiv
15
citations
Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program
ICCV 2025
arXiv
10
citations
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining
ICCV 2025
arXiv
6
citations
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
ICML 2025
arXiv
6
citations
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
ICML 2025
arXiv
4
citations