"large-scale dataset curation" Papers
4 papers found
Conference
CPSea: Large-scale cyclic peptide-protein complex dataset for machine learning in cyclic peptide design
Ziyi Yang, Hanyuan Xie, Yinjun Jia et al.
NEURIPS 2025
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Qingyun Li, Zhe Chen, Weiyun Wang et al.
ICLR 2025arXiv:2406.08418
49
citations
Understanding Museum Exhibits using Vision-Language Reasoning
Ada-Astrid Balauca, Sanjana Garai, Stefan Balauca et al.
ICCV 2025arXiv:2412.01370
1
citations
Data Roaming and Quality Assessment for Composed Image Retrieval
Matan Levy, Rami Ben-Ari, Nir Darshan et al.
AAAI 2024paperarXiv:2303.09429
55
citations