α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Kaihang Pan
Kaihang Pan
8
papers
309
total citations
papers (8)
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
CVPR 2025
arXiv
135
citations
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
ICLR 2024
arXiv
90
citations
Auto-Encoding Morph-Tokens for Multimodal LLM
ICML 2024
arXiv
32
citations
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
CVPR 2025
arXiv
20
citations
STEP: Enhancing Video-LLMs’ Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training
CVPR 2025
arXiv
15
citations
Janus-Pro-R1: Advancing Collaborative Visual Comprehension and Generation via Reinforcement Learning
NEURIPS 2025
arXiv
7
citations
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining
ICCV 2025
arXiv
6
citations
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities
ICML 2025
arXiv
4
citations