Poster "vision-language datasets" Papers
3 papers found
Conference
Semantic and Expressive Variations in Image Captions Across Languages
Andre Ye, Sebastin Santy, Jena D. Hwang et al.
CVPR 2025arXiv:2310.14356
5
citations
DOCCI: Descriptions of Connected and Contrasting Images
Yasumasa Onoe, Sunayana Rane, Zachary E Berger et al.
ECCV 2024arXiv:2404.19753
100
citations
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Guez Aflalo et al.
ECCV 2024arXiv:2404.01197
26
citations