Highlight "cross-modal learning" Papers
2 papers found
Conference
Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding
Huy Ta, Duy Anh Huynh, Yutong Xie et al.
ICCV 2025highlightarXiv:2505.15123
2
citations
Can I Trust Your Answer? Visually Grounded Video Question Answering
Junbin Xiao, Angela Yao, Yicong Li et al.
CVPR 2024highlightarXiv:2309.01327
113
citations