α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Li Fei-Fei
Li Fei-Fei
21
papers
1,810
total citations
papers (21)
Action Genome: Actions As Compositions of Spatio-Temporal Scene Graphs
CVPR 2020
arXiv
393
citations
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
CVPR 2025
arXiv
371
citations
Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning
CVPR 2022
arXiv
217
citations
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
ICML 2024
arXiv
135
citations
Procedure Planning in Instructional Videos
ECCV 2020
arXiv
115
citations
Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction
CVPR 2021
arXiv
114
citations
ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer
CVPR 2022
arXiv
109
citations
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image
CVPR 2024
arXiv
87
citations
The ObjectFolder Benchmark: Multisensory Learning With Neural and Real Objects
CVPR 2023
arXiv
52
citations
WorldScore: Unified Evaluation Benchmark for World Generation
ICCV 2025
46
citations
Re-thinking Temporal Search for Long-Form Video Understanding
CVPR 2025
arXiv
41
citations
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
ICCV 2025
arXiv
27
citations
PrivHAR: Recognizing Human Actions from Privacy-Preserving Lens
ECCV 2022
arXiv
26
citations
Metadata Normalization
CVPR 2021
arXiv
21
citations
Rendering Humans from Object-Occluded Monocular Videos
ICCV 2023
arXiv
19
citations
The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion
CVPR 2025
arXiv
18
citations
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation
CVPR 2024
arXiv
15
citations
Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation
ICCV 2025
arXiv
4
citations
Scalable Differential Privacy With Sparse Network Finetuning
CVPR 2021
0
citations
Revisiting the "Video" in Video-Language Understanding
CVPR 2022
0
citations
RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition
ECCV 2020
0
citations