Poster "image-text pairs" Papers
6 papers found
Conference
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning
Tianyi Bai, Yuxuan Fan, Qiu Jiantao et al.
NEURIPS 2025arXiv:2506.07227
3
citations
NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding
Wei Xu, Cheng Wang, Dingkang Liang et al.
NEURIPS 2025arXiv:2510.27481
2
citations
PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation
Ziyan Wang, Sizhe Wei, Xiaoming Huo et al.
NEURIPS 2025arXiv:2502.08106
1
citations
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
Ji-Jia Wu, Andy Chia-Hao Chang, Chieh-Yu Chuang et al.
CVPR 2024arXiv:2404.04231
20
citations
Low-Rank Similarity Mining for Multimodal Dataset Distillation
Yue Xu, Zhilin Lin, Yusong Qiu et al.
ICML 2024arXiv:2406.03793
11
citations
UniHuman: A Unified Model For Editing Human Images in the Wild
Nannan Li, Qing Liu, Krishna Kumar Singh et al.
CVPR 2024arXiv:2312.14985
14
citations