α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yuliang Liu
Yuliang Liu
21
papers
1,457
total citations
papers (21)
Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
CVPR 2024
arXiv
392
citations
ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network
CVPR 2020
arXiv
385
citations
SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition
CVPR 2022
arXiv
139
citations
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
CVPR 2020
arXiv
117
citations
Turning a CLIP Model Into a Scene Text Detector
CVPR 2023
arXiv
84
citations
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
ICML 2024
arXiv
82
citations
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition
CVPR 2024
arXiv
79
citations
SAPA: Similarity-Aware Point Affiliation for Feature Upsampling
NEURIPS 2022
arXiv
71
citations
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
ICCV 2023
arXiv
42
citations
Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid
ICLR 2025
arXiv
22
citations
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining
AAAI 2024
arXiv
18
citations
Bridging the Gap Between End-to-End and Two-Step Text Spotting
CVPR 2024
arXiv
11
citations
MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification
NEURIPS 2022
arXiv
9
citations
DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding
ICCV 2025
arXiv
4
citations
LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region Assistance
ICCV 2025
arXiv
1
citations
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
CVPR 2025
arXiv
1
citations
Don’t Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context
ECCV 2022
0
citations
Training-free Geometric Image Editing on Diffusion Models
ICCV 2025
arXiv
0
citations
Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution
CVPR 2023
0
citations
Multi-scenario Overlapping Text Segmentation with Depth Awareness
ICCV 2025
0
citations
Towards Comprehensive Lecture Slides Understanding: Large-scale Dataset and Effective Method
ICCV 2025
0
citations