"cross-modal reasoning" Papers
7 papers found
Conference
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
Cheng Yang, Chufan Shi, Yaxin Liu et al.
ICLR 2025arXiv:2406.09961
69
citations
Mitigating Modal Imbalance in Multimodal Reasoning
Chen Henry Wu, Neil Kale, Aditi Raghunathan
COLM 2025paperarXiv:2510.02608
1
citations
OmniBench: Towards The Future of Universal Omni-Language Models
Yizhi Li, Ge Zhang, Yinghao Ma et al.
NEURIPS 2025arXiv:2409.15272
53
citations
TRAP: Targeted Redirecting of Agentic Preferences
Hangoo Kang, Jehyeok Yeon, Gagandeep Singh
NEURIPS 2025arXiv:2505.23518
3
citations
Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Yiheng Li, Yang Yang, Zichang Tan et al.
CVPR 2025arXiv:2506.05890
3
citations
Vinci: Deep Thinking in Text-to-Image Generation using Unified Model with Reinforcement Learning
wang lin, Wentao Hu, Liyu Jia et al.
NEURIPS 2025
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning
Artemis Panagopoulou, Le Xue, Ning Yu et al.
ECCV 2024
6
citations