α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yingya Zhang
Yingya Zhang
22
papers
1,274
total citations
papers (22)
VideoComposer: Compositional Video Synthesis with Motion Controllability
NEURIPS 2023
arXiv
466
citations
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
CVPR 2024
arXiv
158
citations
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
CVPR 2025
arXiv
110
citations
InstructVideo: Instructing Video Diffusion Models with Human Feedback
CVPR 2024
arXiv
83
citations
DecentLaM: Decentralized Momentum SGD for Large-Batch Deep Training
ICCV 2021
arXiv
69
citations
RLIPv2: Fast Scaling of Relational Language-Image Pre-Training
ICCV 2023
arXiv
63
citations
MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition
CVPR 2023
arXiv
61
citations
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
CVPR 2024
arXiv
57
citations
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
CVPR 2024
arXiv
55
citations
Revisiting Optimal Convergence Rate for Smooth and Non-convex Stochastic Decentralized Optimization
NEURIPS 2022
arXiv
31
citations
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
ICCV 2023
arXiv
31
citations
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis
AAAI 2024
arXiv
23
citations
PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
ICCV 2025
arXiv
19
citations
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
ICCV 2025
arXiv
19
citations
DreamRelation: Relation-Centric Video Customization
ICCV 2025
arXiv
17
citations
Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition
CVPR 2023
arXiv
5
citations
S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis
ECCV 2024
arXiv
5
citations
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
AAAI 2025
arXiv
2
citations
FaceComposer: A Unified Model for Versatile Facial Content Creation
NEURIPS 2023
0
citations
LipFormer: High-Fidelity and Generalizable Talking Face Generation With a Pre-Learned Facial Codebook
CVPR 2023
0
citations
Space-time Prompting for Video Class-incremental Learning
ICCV 2023
0
citations
Communication Efficient SGD via Gradient Sampling With Bayes Prior
CVPR 2021
0
citations