α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Fan Zhang
Fan Zhang
OpenReview
29
papers
2,165
total citations
papers (29)
VBench: Comprehensive Benchmark Suite for Video Generative Models
CVPR 2024
arXiv
1,072
citations
Generative Multimodal Models are In-Context Learners
CVPR 2024
arXiv
438
citations
LDMVFI: Video Frame Interpolation with Latent Diffusion Models
AAAI 2024
arXiv
97
citations
CapsFusion: Rethinking Image-Text Data at Scale
CVPR 2024
arXiv
91
citations
HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation
NEURIPS 2023
arXiv
82
citations
Unsupervised Instance Segmentation in Microscopy Images via Panoptic Domain Adaptation and Task Re-Weighting
CVPR 2020
arXiv
81
citations
HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos
CVPR 2025
arXiv
56
citations
ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation
CVPR 2022
arXiv
56
citations
LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content
CVPR 2024
arXiv
35
citations
Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion
CVPR 2024
arXiv
32
citations
MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition
ICCV 2023
arXiv
31
citations
Distributionally Robust Local Non-parametric Conditional Estimation
NEURIPS 2020
arXiv
28
citations
PNVC: Towards Practical INR-based Video Compression
AAAI 2025
arXiv
14
citations
HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution
CVPR 2025
arXiv
13
citations
UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion
CVPR 2025
arXiv
10
citations
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models
NEURIPS 2025
arXiv
7
citations
SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation
AAAI 2025
arXiv
6
citations
GIViC: Generative Implicit Video Compression
ICCV 2025
arXiv
6
citations
Fine-grained Prototypical Voting with Heterogeneous Mixup for Semi-supervised 2D-3D Cross-modal Retrieval
CVPR 2024
4
citations
HumanSAM: Classifying Human-centric Forgery Videos in Human Spatial, Appearance, and Motion Anomaly
ICCV 2025
arXiv
3
citations
Blind Video Super-Resolution based on Implicit Kernels
ICCV 2025
arXiv
1
citations
CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering
ICCV 2025
arXiv
1
citations
AdaptiveAE: An Adaptive Exposure Strategy for HDR Capturing in Dynamic Scenes
ICCV 2025
arXiv
1
citations
Learning Temporal Consistency for Low Light Video Enhancement From Single Images
CVPR 2021
0
citations
Learning Rain Location Prior for Nighttime Deraining
ICCV 2023
0
citations
DREAM: Decoupled Discriminative Learning with Bigraph-aware Alignment for Semi-supervised 2D-3D Cross-modal Retrieval
AAAI 2025
0
citations
GauUpdate: New Object Insertion in 3D Gaussian Fields with Consistent Global Illumination
ICCV 2025
0
citations
OneGT: One-Shot Geometry-Texture Neural Rendering for Head Avatars
ICCV 2025
0
citations
Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning
CVPR 2025
0
citations