α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Qi Zheng
Qi Zheng
14
papers
337
total citations
papers (14)
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
CVPR 2024
arXiv
109
citations
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
arXiv
70
citations
GeoLayoutLM: Geometric Pre-Training for Visual Information Extraction
CVPR 2023
arXiv
63
citations
Vision Grid Transformer for Document Layout Analysis
ICCV 2023
arXiv
53
citations
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
ICCV 2023
arXiv
22
citations
A Simple yet Effective Layout Token in Large Language Models for Document Understanding
CVPR 2025
arXiv
7
citations
ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data
AAAI 2025
arXiv
6
citations
WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation
ECCV 2024
arXiv
5
citations
End-to-End HOI Reconstruction Transformer with Graph-based Encoding
CVPR 2025
arXiv
1
citations
ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting
AAAI 2025
arXiv
1
citations
Syntax-Aware Action Targeting for Video Captioning
CVPR 2020
0
citations
Modeling Video As Stochastic Processes for Fine-Grained Video Representation Learning
CVPR 2023
0
citations
An End-to-End OCR Text Re-organization Sequence Learning for Rich-text Detail Image Comprehension
ECCV 2020
0
citations
Frequency-Biased Synergistic Design for Image Compression and Compensation
CVPR 2025
0
citations