α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zhenheng Yang
Zhenheng Yang
10
papers
965
total citations
papers (10)
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
ICLR 2025
arXiv
483
citations
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
ICLR 2025
arXiv
207
citations
Show-o2: Improved Native Unified Multimodal Models
NEURIPS 2025
arXiv
106
citations
Long Context Tuning for Video Generation
ICCV 2025
arXiv
60
citations
Parallelized Autoregressive Visual Generation
CVPR 2025
arXiv
42
citations
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
ICCV 2025
arXiv
27
citations
Weakly Supervised Instance Segmentation for Videos With Temporal Mask Consistency
CVPR 2021
arXiv
25
citations
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
CVPR 2025
arXiv
14
citations
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
NEURIPS 2025
arXiv
1
citations
SPAN: Spatial Pyramid Attention Network for Image Manipulation Localization
ECCV 2020
0
citations