"language-video alignment" Papers
2 papers found
Conference
Language Model Guided Interpretable Video Action Reasoning
Ning Wang, Guangming Zhu, Hongsheng Li et al.
CVPR 2024arXiv:2404.01591
7
citations
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval
Zhihang Liu, Jun Li, Hongtao Xie et al.
AAAI 2024paperarXiv:2312.12155
41
citations