"multi-modal generation" Papers
3 papers found
Conference
CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models
Zihui Cheng, Qiguang Chen, Jin Zhang et al.
AAAI 2025paperarXiv:2412.12932
30
citations
Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen
Alessandro Palma, Till Richter, Hanyi Zhang et al.
ICLR 2025arXiv:2407.11734
9
citations
Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
Jihyun Kim, Changjae Oh, Hoseok Do et al.
CVPR 2024arXiv:2405.04356
24
citations