Poster "visual-language alignment" Papers
3 papers found
Conference
Logits DeConfusion with CLIP for Few-Shot Learning
Shuo Li, Fang Liu, Zehua Hao et al.
CVPR 2025arXiv:2504.12104
6
citations
MUSE-VL: Modeling Unified VLM through Semantic Discrete Encoding
Rongchang Xie, Chen Du, Ping Song et al.
ICCV 2025arXiv:2411.17762
27
citations
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
Congpei Qiu, Yanhao Wu, Wei Ke et al.
ICLR 2025arXiv:2504.02328
7
citations