Poster "sequence-to-sequence modeling" Papers
2 papers found
Conference
Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity
Santiago Pascual, Chunghsin YEH, Ioannis Tsiamas et al.
ECCV 2024arXiv:2407.10387
31
citations
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong et al.
CVPR 2024arXiv:2401.14405
12
citations