α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Youngjae Yu
Youngjae Yu
18
papers
1,022
total citations
papers (18)
MERLOT: Multimodal Neural Script Knowledge Models
NEURIPS 2021
arXiv
433
citations
MERLOT Reserve: Neural Script Knowledge Through Vision and Language and Sound
CVPR 2022
arXiv
241
citations
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text
NEURIPS 2023
arXiv
222
citations
ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning
ICCV 2021
arXiv
68
citations
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos
ICCV 2023
arXiv
21
citations
Localized Symbolic Knowledge Distillation for Visual Commonsense Models
NEURIPS 2023
arXiv
13
citations
DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation
AAAI 2025
arXiv
9
citations
DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
ICCV 2025
arXiv
7
citations
VAGUE: Visual Contexts Clarify Ambiguous Expressions
ICCV 2025
arXiv
3
citations
ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO
AAAI 2025
arXiv
3
citations
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
ECCV 2024
arXiv
1
citations
V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models
ICCV 2025
arXiv
1
citations
Pano-AVQA: Grounded Audio-Visual Question Answering on 360deg Videos
ICCV 2021
0
citations
Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning
CVPR 2023
0
citations
Character Grounding and Re-Identification in Story of Videos and Text Descriptions
ECCV 2020
0
citations
Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation
NEURIPS 2025
arXiv
0
citations
MASS: Overcoming Language Bias in Image-Text Matching
AAAI 2025
arXiv
0
citations
Transitional Adaptation of Pretrained Models for Visual Storytelling
CVPR 2021
0
citations