Poster "visual document understanding" Papers
2 papers found
Conference
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning
Minheng Ni, Zhengyuan Yang, Linjie Li et al.
NEURIPS 2025arXiv:2505.19702
13
citations
Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark
Hao Guo, Xugong Qin, Jun Jie Ou Yang et al.
CVPR 2025arXiv:2512.20174
1
citations