"video-text alignment" Papers
4 papers found
Conference
HiERO: Understanding the Hierarchy of Human Behavior Enhances Reasoning on Egocentric Videos
Simone Alberto Peirone, Francesca Pistilli, Giuseppe Averta
ICCV 2025arXiv:2505.12911
1
citations
Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval
Boseung Jeong, Jicheol Park, Sungyeon Kim et al.
CVPR 2025arXiv:2504.02397
4
citations
Uni-Sign: Toward Unified Sign Language Understanding at Scale
Zecheng Li, Wengang Zhou, Weichao Zhao et al.
ICLR 2025arXiv:2501.15187
39
citations
Reinforcement Learning Friendly Vision-Language Model for Minecraft
Haobin Jiang, Junpeng Yue, Hao Luo et al.
ECCV 2024arXiv:2303.10571
15
citations