α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Triantafyllos Afouras
Triantafyllos Afouras
15
papers
1,462
total citations
papers (15)
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
CVPR 2024
arXiv
343
citations
Self-Supervised Learning of Audio-Visual Objects from Video
ECCV 2020
arXiv
278
citations
Localizing Visual Sounds the Hard Way
CVPR 2021
arXiv
227
citations
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues
ECCV 2020
arXiv
203
citations
Sub-Word Level Lip Reading With Visual Attention
CVPR 2022
arXiv
112
citations
Self-Supervised Object Detection From Audio-Visual Correspondence
CVPR 2022
arXiv
53
citations
Read and Attend: Temporal Localisation in Sign Language Videos
CVPR 2021
arXiv
48
citations
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
NEURIPS 2025
arXiv
47
citations
Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
NEURIPS 2023
arXiv
39
citations
Aligning Subtitles in Sign Language Videos
ICCV 2021
arXiv
38
citations
Reading To Listen at the Cocktail Party: Multi-Modal Speech Separation
CVPR 2022
arXiv
34
citations
Learning to Ground Instructional Articles in Videos through Narrations
ICCV 2023
arXiv
27
citations
MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
ICML 2024
arXiv
13
citations
HT-Step: Aligning Instructional Articles with How-To Videos
NEURIPS 2023
0
citations
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
ICCV 2025
arXiv
0
citations