α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Dima Damen
Dima Damen
21
papers
2,982
total citations
papers (21)
Ego4D: Around the World in 3,000 Hours of Egocentric Video
CVPR 2022
arXiv
1,511
citations
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
CVPR 2024
arXiv
343
citations
Egocentric Video-Language Pretraining
NEURIPS 2022
arXiv
254
citations
Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
CVPR 2020
arXiv
229
citations
Temporal-Relational CrossTransformers for Few-Shot Action Recognition
CVPR 2021
arXiv
182
citations
EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations
NEURIPS 2022
arXiv
134
citations
On Semantic Similarity in Video Retrieval
CVPR 2021
arXiv
76
citations
EPIC Fields: Marrying 3D Geometry and Video Understanding
NEURIPS 2023
arXiv
45
citations
HD-EPIC: A Highly-Detailed Egocentric Video Dataset
CVPR 2025
arXiv
40
citations
Action Modifiers: Learning From Adverbs in Instructional Videos
CVPR 2020
arXiv
38
citations
TIM: A Time Interval Machine for Audio-Visual Action Recognition
CVPR 2024
arXiv
27
citations
What Can a Cook in Italy Teach a Mechanic in India? Action Recognition Generalisation Over Scenarios and Locations
ICCV 2023
arXiv
24
citations
Use Your Head: Improving Long-Tail Video Recognition
CVPR 2023
arXiv
23
citations
UnweaveNet: Unweaving Activity Stories
CVPR 2022
arXiv
16
citations
The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction
CVPR 2023
arXiv
14
citations
Learning from One Continuous Video Stream
CVPR 2024
arXiv
10
citations
ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions
CVPR 2025
arXiv
6
citations
Learning from Streaming Video with Orthogonal Gradients
CVPR 2025
arXiv
6
citations
Context-Aware Multimodal Pretraining
CVPR 2025
arXiv
4
citations
GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
CVPR 2024
0
citations
Perception Test: A Diagnostic Benchmark for Multimodal Video Models
NEURIPS 2023
0
citations