α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xiaodong Cun
Xiaodong Cun
OpenReview
1
Affiliations
Affiliations
Great Bay University
29
papers
6,130
total citations
papers (29)
Uformer: A General U-Shaped Transformer for Image Restoration
CVPR 2022
arXiv
1,928
citations
Generating Human Motion From Textual Descriptions With Discrete Representations
CVPR 2023
arXiv
547
citations
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
CVPR 2024
arXiv
512
citations
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
ICCV 2023
arXiv
475
citations
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
CVPR 2023
arXiv
414
citations
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos
AAAI 2024
arXiv
284
citations
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
CVPR 2024
arXiv
248
citations
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
ECCV 2022
arXiv
215
citations
CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior
CVPR 2023
arXiv
208
citations
Explicit Visual Prompting for Low-Level Structure Segmentations
CVPR 2023
arXiv
200
citations
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
CVPR 2025
arXiv
158
citations
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
CVPR 2024
arXiv
143
citations
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
ICLR 2024
arXiv
111
citations
DEIM: DETR with Improved Matching for Fast Convergence
CVPR 2025
arXiv
107
citations
Inserting Anybody in Diffusion Models via Celeb Basis
NEURIPS 2023
arXiv
72
citations
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net
ICCV 2023
arXiv
65
citations
DPE: Disentanglement of Pose and Expression for General Video Portrait Editing
CVPR 2023
arXiv
58
citations
3D GAN Inversion With Facial Symmetry Prior
CVPR 2023
arXiv
57
citations
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
ECCV 2024
arXiv
51
citations
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization
ECCV 2022
arXiv
48
citations
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation
CVPR 2025
arXiv
46
citations
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation
ICCV 2023
arXiv
39
citations
Depth-aware Test-Time Training for Zero-shot Video Object Segmentation
CVPR 2024
arXiv
39
citations
Defocus Blur Detection via Depth Distillation
ECCV 2020
arXiv
31
citations
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
CVPR 2024
arXiv
30
citations
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
CVPR 2024
arXiv
18
citations
ToonTalker: Cross-Domain Face Reenactment
ICCV 2023
arXiv
12
citations
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training
AAAI 2025
arXiv
11
citations
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
ECCV 2024
arXiv
3
citations