"visual-text alignment" Papers
3 papers found
Conference
Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective
Xinmiao Yu, Xiaocheng Feng, Yun Li et al.
AAAI 2025paperarXiv:2412.17787
Learnable Retrieval Enhanced Visual-Text Alignment and Fusion for Radiology Report Generation
Qin Zhou, Guoyan Liang, Xindi Li et al.
ICCV 2025arXiv:2507.07568
Learning to Generalize without Bias for Open-Vocabulary Action Recognition
Yating Yu, Congqi Cao, Yifan Zhang et al.
ICCV 2025highlightarXiv:2502.20158
4
citations