α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zhendong Mao
Zhendong Mao
26
papers
902
total citations
papers (26)
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
CVPR 2021
arXiv
377
citations
Graph Structured Network for Image-Text Matching
CVPR 2020
arXiv
285
citations
Towards Accurate Image Coding: Improved Autoregressive Image Generation With Dynamic Vector Quantization
CVPR 2023
arXiv
60
citations
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment
CVPR 2024
41
citations
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
CVPR 2024
arXiv
39
citations
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
CVPR 2023
arXiv
31
citations
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
ICCV 2025
arXiv
13
citations
Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection
AAAI 2025
arXiv
12
citations
Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching
AAAI 2024
10
citations
CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization
AAAI 2025
arXiv
9
citations
D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation
CVPR 2025
6
citations
DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization
ICCV 2025
arXiv
5
citations
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
ICCV 2025
arXiv
5
citations
Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing
AAAI 2024
arXiv
3
citations
FeedEdit: Text-Based Image Editing with Dynamic Feedback Regulation
CVPR 2025
3
citations
ELDER: Enhancing Lifelong Model Editing with Mixture-of-LoRA
AAAI 2025
arXiv
2
citations
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation
CVPR 2025
arXiv
1
citations
1066 Benchmarking Large Language Models on Controllable Generation under Diversified Instructions
AAAI 2024
0
citations
Hierarchy-Aware Pseudo Word Learning with Text Adaptation for Zero-Shot Composed Image Retrieval
ICCV 2025
0
citations
A4A: Adapter for Adapter Transfer via All-for-All Mapping for Cross-Architecture Models
CVPR 2025
0
citations
Crossing the Gap: Domain Generalization for Image Captioning
CVPR 2023
0
citations
Lesion-Aware Transformers for Diabetic Retinopathy Grading
CVPR 2021
0
citations
DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation
AAAI 2024
0
citations
Negative-Aware Attention Framework for Image-Text Matching
CVPR 2022
0
citations
Learning Semantic Relationship Among Instances for Image-Text Matching
CVPR 2023
0
citations
Dragin3D: Image Editing by Dragging in 3D Space
CVPR 2025
0
citations