"key information extraction" Papers
2 papers found
Conference
CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy
Zhibo Yang, Jun Tang, Zhaohai Li et al.
ICCV 2025arXiv:2412.02210
43
citations
Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models
zhentao he, Can Zhang, Ziheng Wu et al.
NEURIPS 2025arXiv:2506.20168
2
citations