α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Ming Ding
Ming Ding
Google Scholar
OpenReview
23
h-index
12
papers
4,523
total citations
papers (12)
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
arXiv
1,409
citations
CogView: Mastering Text-to-Image Generation via Transformers
NEURIPS 2021
arXiv
934
citations
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
NEURIPS 2023
arXiv
803
citations
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
arXiv
629
citations
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
NEURIPS 2022
arXiv
402
citations
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
arXiv
229
citations
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
arXiv
70
citations
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
ICLR 2025
arXiv
36
citations
Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer.
ECCV 2024
arXiv
11
citations
UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis
NEURIPS 2021
0
citations
Adaptive Diffusion in Graph Neural Networks
NEURIPS 2021
0
citations
CogLTX: Applying BERT to Long Texts
NEURIPS 2020
0
citations