Paper "multimodal video understanding" Papers
2 papers found
Conference
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
Siran Chen, Yuxiao Luo, Yue Ma et al.
AAAI 2025paperarXiv:2501.04302
6
citations
Exploiting Auxiliary Caption for Video Grounding
Hongxiang Li, Meng Cao, Xuxin Cheng et al.
AAAI 2024paperarXiv:2301.05997
14
citations