"cross-modal embedding" Papers
2 papers found
Conference
Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification
Yang Qin, Chao Chen, Zhihang Fu et al.
CVPR 2025arXiv:2506.11036
9
citations
ConDense: Consistent 2D-3D Pre-training for Dense and Sparse Features from Multi-View Images
Xiaoshuai Zhang, Zhicheng Wang, Howard Zhou et al.
ECCV 2024arXiv:2408.17027
8
citations