α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Wenyi Hong
Wenyi Hong
Google Scholar
OpenReview
15
h-index
9
papers
3,747
total citations
papers (9)
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
arXiv
1,409
citations
CogView: Mastering Text-to-Image Generation via Transformers
NEURIPS 2021
arXiv
934
citations
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
arXiv
629
citations
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
NEURIPS 2022
arXiv
402
citations
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
arXiv
229
citations
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
arXiv
70
citations
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
ICLR 2025
arXiv
36
citations
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
CVPR 2025
arXiv
27
citations
Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer.
ECCV 2024
arXiv
11
citations