Poster "contrastive training" Papers
4 papers found
Conference
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Chest X-ray with Zero-Shot Multi-Task Capability
Jonggwon Park, Byungmu Yoon, Soobum Kim et al.
NEURIPS 2025arXiv:2504.07416
1
citations
C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
Juntao Zhang, Yuehuai LIU, Yu-Wing Tai et al.
CVPR 2024arXiv:2311.17951
9
citations
Distilling Vision-Language Models on Millions of Videos
Yue Zhao, Long Zhao, Xingyi Zhou et al.
CVPR 2024arXiv:2401.06129
21
citations
Grounding Language Models for Visual Entity Recognition
Zilin Xiao, Ming Gong, Paola Cascante-Bonilla et al.
ECCV 2024arXiv:2402.18695
13
citations