α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
James Glass
James Glass
8
papers
639
total citations
papers (8)
Listen, Think, and Understand
ICLR 2024
arXiv
224
citations
Everything at Once - Multi-Modal Fusion Transformer for Video Retrieval
CVPR 2022
arXiv
157
citations
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
ICCV 2021
arXiv
97
citations
Curiosity-driven Red-teaming for Large Language Models
ICLR 2024
arXiv
79
citations
Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions
CVPR 2021
arXiv
68
citations
What When and Where? Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
CVPR 2024
arXiv
9
citations
Teaching VLMs to Localize Specific Objects from In-context Examples
ICCV 2025
arXiv
3
citations
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
CVPR 2025
arXiv
2
citations