"video-language alignment" Papers
7 papers found
Conference
Can Text-to-Video Generation help Video-Language Alignment?
Luca Zanella, Massimiliano Mancini, Willi Menapace et al.
CVPR 2025arXiv:2503.18507
1
citations
DistinctAD: Distinctive Audio Description Generation in Contexts
Bo Fang, Wenhao Wu, Qiangqiang Wu et al.
CVPR 2025highlightarXiv:2411.18180
4
citations
A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Pinelopi Papalampidi, Skanda Koppula, Shreya Pathak et al.
CVPR 2024arXiv:2312.07395
41
citations
Multi-granularity Correspondence Learning from Long-term Noisy Videos
Yijie Lin, Jie Zhang, Zhenyu Huang et al.
ICLR 2024oralarXiv:2401.16702
38
citations
PiTe: Pixel-Temporal Alignment for Large Video-Language Model
Yang Liu, Pengxiang Ding, Siteng Huang et al.
ECCV 2024arXiv:2409.07239
9
citations
VideoCon: Robust Video-Language Alignment via Contrast Captions
Hritik Bansal, Yonatan Bitton, Idan Szpektor et al.
CVPR 2024arXiv:2311.10111
30
citations
VidLA: Video-Language Alignment at Scale
Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan et al.
CVPR 2024arXiv:2403.14870
8
citations