α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Ruohan Gao
Ruohan Gao
1
Affiliations
Affiliations
The University of Texas at Austin
18
papers
939
total citations
papers (18)
Listen to Look: Action Recognition by Previewing Audio
CVPR 2020
arXiv
285
citations
VisualVoice: Audio-Visual Speech Separation With Cross-Modal Consistency
CVPR 2021
arXiv
242
citations
ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer
CVPR 2022
arXiv
109
citations
VisualEchoes: Spatial Image Representation Learning through Echolocation
ECCV 2020
arXiv
91
citations
Visual Acoustic Matching
CVPR 2022
arXiv
65
citations
The ObjectFolder Benchmark: Multisensory Learning With Neural and Real Objects
CVPR 2023
arXiv
52
citations
The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective
CVPR 2024
arXiv
16
citations
Hearing Anything Anywhere
CVPR 2024
arXiv
13
citations
RealImpact: A Dataset of Impact Sound Fields for Real Objects
CVPR 2023
arXiv
13
citations
SoundCam: A Dataset for Finding Humans Using Room Acoustics
NEURIPS 2023
arXiv
11
citations
AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs
ICCV 2025
arXiv
10
citations
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
ECCV 2024
arXiv
7
citations
Hearing Anywhere in Any Environment
CVPR 2025
arXiv
6
citations
AURELIA: Test-time Reasoning Distillation in Audio-Visual LLMs
ICCV 2025
arXiv
6
citations
Learning to Highlight Audio by Watching Movies
CVPR 2025
arXiv
5
citations
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
ICCV 2025
arXiv
4
citations
EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception
ICCV 2025
arXiv
2
citations
Differentiable Room Acoustic Rendering with Multi-View Vision Priors
ICCV 2025
arXiv
2
citations