Poster "vision-language reasoning" Papers
2 papers found
Conference
DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding
Zixuan Liu, Siavash H. Khajavi, Guangkai Jiang
NEURIPS 2025arXiv:2511.02495
Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models
zhentao he, Can Zhang, Ziheng Wu et al.
NEURIPS 2025arXiv:2506.20168
2
citations