α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yapeng Tian
Yapeng Tian
24
papers
2,179
total citations
papers (24)
TDAN: Temporally-Deformable Alignment Network for Video Super-Resolution
CVPR 2020
arXiv
614
citations
DiffIR: Efficient Diffusion Model for Image Restoration
ICCV 2023
arXiv
357
citations
Learning To Answer Questions in Dynamic Audio-Visual Scenarios
CVPR 2022
arXiv
221
citations
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing
ECCV 2020
arXiv
213
citations
Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
CVPR 2020
arXiv
198
citations
Transformer-Empowered Multi-Scale Contextual Matching and Aggregation for Multi-Contrast MRI Super-Resolution
CVPR 2022
arXiv
94
citations
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
CVPR 2021
arXiv
92
citations
Audio-Visual Grouping Network for Sound Localization From Mixtures
CVPR 2023
arXiv
64
citations
AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis
NEURIPS 2023
arXiv
60
citations
Egocentric Audio-Visual Object Localization
CVPR 2023
arXiv
48
citations
Can Audio-Visual Integration Strengthen Robustness Under Multimodal Attacks?
CVPR 2021
arXiv
41
citations
Audio-Visual Class-Incremental Learning
ICCV 2023
arXiv
35
citations
Class-Incremental Grouping Network for Continual Audio-Visual Learning
ICCV 2023
arXiv
31
citations
Structured Sparsity Learning for Efficient Video Super-Resolution
CVPR 2023
arXiv
25
citations
Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning
NEURIPS 2023
arXiv
23
citations
T-VSL: Text-Guided Visual Sound Source Localization in Mixtures
CVPR 2024
arXiv
22
citations
VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation
CVPR 2025
arXiv
12
citations
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
CVPR 2025
arXiv
11
citations
Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
COLM 2025
arXiv
8
citations
Learning Spatio-Temporal Downsampling for Effective Video Upscaling
ECCV 2022
arXiv
7
citations
PRVQL: Progressive Knowledge-guided Refinement for Robust Egocentric Visual Query Localization
ICCV 2025
arXiv
3
citations
ZFusion: Efficient Deep Compositional Zero-shot Learning for Blind Image Super-Resolution with Generative Diffusion Prior
ICCV 2025
0
citations
Multi-modal Grouping Network for Weakly-Supervised Audio-Visual Video Parsing
NEURIPS 2022
0
citations
Video Matting via Consistency-Regularized Graph Neural Networks
ICCV 2021
0
citations