ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Triantafyllos Afouras

Triantafyllos Afouras

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 14, 2026, 11:22 PM AMS

15

papers

1,462

total citations

papers (15)

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Self-Supervised Learning of Audio-Visual Objects from Video

Localizing Visual Sounds the Hard Way

BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues

Sub-Word Level Lip Reading With Visual Attention

Self-Supervised Object Detection From Audio-Visual Correspondence

Read and Attend: Temporal Localisation in Sign Language Videos

PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

NEURIPS 2025arXiv

Video-Mined Task Graphs for Keystep Recognition in Instructional Videos

NEURIPS 2023arXiv

Aligning Subtitles in Sign Language Videos

Reading To Listen at the Cocktail Party: Multi-Modal Speech Separation

Learning to Ground Instructional Articles in Videos through Narrations

MusicFlow: Cascaded Flow Matching for Text Guided Music Generation

HT-Step: Aligning Instructional Articles with How-To Videos

Enrich and Detect: Video Temporal Grounding with Multimodal LLMs