"audio-visual learning" Papers
12 papers found
Conference
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
Edson Araujo, Andrew Rouditchenko, Yuan Gong et al.
CVPR 2025arXiv:2505.01237
2
citations
Circumventing Shortcuts in Audio-visual Deepfake Detection Datasets with Unsupervised Learning
Stefan Smeu, Dragos-Alexandru Boldisor, Dan Oneata et al.
CVPR 2025highlightarXiv:2412.00175
9
citations
Clink! Chop! Thud! - Learning Object Sounds from Real-World Interactions
Mengyu Yang, Yiming Chen, Haozheng Pei et al.
ICCV 2025arXiv:2510.02313
Differentiable Room Acoustic Rendering with Multi-View Vision Priors
Derong Jin, Ruohan Gao
ICCV 2025arXiv:2504.21847
2
citations
Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes
Yiming Dou, Wonseok Oh, Yuqing Luo et al.
CVPR 2025arXiv:2506.09989
Language-Guided Audio-Visual Learning for Long-Term Sports Assessment
Huangbiao Xu, Xiao Ke, Huanqi Wu et al.
CVPR 2025
6
citations
Progressive Homeostatic and Plastic Prompt Tuning for Audio-Visual Multi-Task Incremental Learning
Jiong Yin, Liang Li, Jiehua Zhang et al.
ICCV 2025arXiv:2507.21588
1
citations
Audio-visual Generalized Zero-shot Learning the Easy Way
Shentong Mo, Pedro Morgado
ECCV 2024arXiv:2407.13095
8
citations
EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning
Jongsuk Kim, Hyeongkeun Lee, Kyeongha Rho et al.
ICML 2024arXiv:2403.09502
12
citations
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Jian Ma, Wenguan Wang, Yi Yang et al.
ECCV 2024arXiv:2407.10373
1
citations
Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection
Yunfeng Fan, Wenchao Xu, Haozhao Wang et al.
ECCV 2024arXiv:2401.00403
7
citations
Self-Supervised Audio-Visual Soundscape Stylization
Tingle Li, Renhao Wang, Po-Yao Huang et al.
ECCV 2024arXiv:2409.14340
7
citations