Poster "image-to-text generation" Papers
4 papers found
Conference
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Rui Zhao, Weijia Mao, Mike Zheng Shou
CVPR 2025arXiv:2503.03651
4
citations
FlowTok: Flowing Seamlessly Across Text and Image Tokens
Ju He, Qihang Yu, Qihao Liu et al.
ICCV 2025arXiv:2503.10772
18
citations
DOCCI: Descriptions of Connected and Contrasting Images
Yasumasa Onoe, Sunayana Rane, Zachary E Berger et al.
ECCV 2024arXiv:2404.19753
100
citations
TrojVLM: Backdoor Attack Against Vision Language Models
Weimin Lyu, Lu Pang, Tengfei Ma et al.
ECCV 2024arXiv:2409.19232
25
citations