α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Christoph Feichtenhofer
Christoph Feichtenhofer
29
papers
20,032
total citations
papers (29)
A ConvNet for the 2020s
CVPR 2022
arXiv
7,279
citations
SAM 2: Segment Anything in Images and Videos
ICLR 2025
arXiv
2,393
citations
Multiscale Vision Transformers
ICCV 2021
arXiv
1,529
citations
Ego4D: Around the World in 3,000 Hours of Egocentric Video
CVPR 2022
arXiv
1,511
citations
X3D: Expanding Architectures for Efficient Video Recognition
CVPR 2020
arXiv
1,240
citations
TrackFormer: Multi-Object Tracking With Transformers
CVPR 2022
arXiv
968
citations
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
CVPR 2022
arXiv
856
citations
Masked Feature Prediction for Self-Supervised Visual Pre-Training
CVPR 2022
arXiv
801
citations
Masked Autoencoders As Spatiotemporal Learners
NEURIPS 2022
arXiv
598
citations
Scaling Language-Image Pre-Training via Masking
CVPR 2023
arXiv
398
citations
Masked Autoencoders that Listen
NEURIPS 2022
arXiv
395
citations
Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers
NEURIPS 2021
arXiv
342
citations
A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning
CVPR 2021
arXiv
288
citations
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
CVPR 2022
arXiv
247
citations
Demystifying CLIP Data
ICLR 2024
arXiv
216
citations
Ego-Topo: Environment Affordances From Egocentric Video
CVPR 2020
arXiv
140
citations
Perception Encoder: The best visual embeddings are not at the output of the network
NEURIPS 2025
arXiv
129
citations
A Multigrid Method for Efficiently Training Video Models
CVPR 2020
arXiv
99
citations
Multiview Compressive Coding for 3D Reconstruction
CVPR 2023
arXiv
91
citations
The Effectiveness of MAE Pre-Pretraining for Billion-Scale Pretraining
ICCV 2023
arXiv
86
citations
MAViL: Masked Audio-Video Learners
NEURIPS 2023
arXiv
75
citations
Diffusion Models as Masked Autoencoders
ICCV 2023
arXiv
75
citations
Reversible Vision Transformers
CVPR 2022
arXiv
61
citations
Multiview Pseudo-Labeling for Semi-Supervised Learning From Video
ICCV 2021
arXiv
54
citations
On the Benefits of 3D Pose and Tracking for Human Action Recognition
CVPR 2023
arXiv
48
citations
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
NEURIPS 2025
arXiv
47
citations
CiT: Curation in Training for Effective Vision-Language Data
ICCV 2023
arXiv
31
citations
Window Attention is Bugged: How not to Interpolate Position Embeddings
ICLR 2024
arXiv
19
citations
An Empirical Study of Autoregressive Pre-training from Videos
ICCV 2025
arXiv
16
citations