Poster "masked token prediction" Papers
3 papers found
Conference
OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation
Mohamed Abdelfattah, Kaouther Messaoud, Alexandre Alahi
NEURIPS 2025
Towards A Translative Model of Sperm Whale Vocalization
Orr Paradise, Liangyuan Chen, Pranav Muralikrishnan et al.
NEURIPS 2025arXiv:2512.02206
1
citations
From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation
Kun Su, Xiulong Liu, Eli Shlizerman
ICML 2024arXiv:2409.19132
17
citations