"dataset construction pipeline" Papers
3 papers found
Conference
Instruction-based Image Manipulation by Watching How Things Move
Mingdeng Cao, Xuaner Zhang, Yinqiang Zheng et al.
CVPR 2025highlightarXiv:2412.12087
8
citations
M^3EL: A Multi-task Multi-topic Dataset for Multi-modal Entity Linking
Fang Wang, Shenglin Yin, Xiaoying Bai et al.
AAAI 2025paperarXiv:2410.18096
1
citations
MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning
Tianhong Gao, Yannian Fu, Weiqun Wu et al.
ICCV 2025arXiv:2507.21924
1
citations