α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Enze Xie
Enze Xie
28
papers
16,599
total citations
papers (28)
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
NEURIPS 2021
arXiv
7,284
citations
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions
ICCV 2021
arXiv
4,656
citations
BEVFormer: Learning Bird’s-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
ECCV 2022
arXiv
1,720
citations
PolarMask: Single Shot Instance Segmentation With Polar Representation
CVPR 2020
arXiv
606
citations
DetCo: Unsupervised Contrastive Learning for Object Detection
ICCV 2021
arXiv
355
citations
PixArt-Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
ECCV 2024
223
citations
MagicDrive: Street View Generation with Diverse 3D Geometry Control
ICLR 2024
arXiv
218
citations
Segmenting Transparent Objects in the Wild
ECCV 2020
arXiv
206
citations
DDP: Diffusion Model for Dense Visual Prediction
ICCV 2023
arXiv
205
citations
Panoptic SegFormer: Delving Deeper Into Panoptic Segmentation With Transformers
CVPR 2022
arXiv
176
citations
Scene Text Image Super-resolution in the wild
ECCV 2020
arXiv
163
citations
Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation
ECCV 2020
arXiv
139
citations
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation
NEURIPS 2023
arXiv
116
citations
LEGO-Prover: Neural Theorem Proving with Growing Libraries
ICLR 2024
arXiv
112
citations
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-efficient Fine-Tuning
ICCV 2023
arXiv
92
citations
Beyond One-to-One: Rethinking the Referring Image Segmentation
ICCV 2023
arXiv
72
citations
Accelerating Diffusion Sampling with Optimized Time Steps
CVPR 2024
arXiv
55
citations
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection
NEURIPS 2023
arXiv
51
citations
DiffComplete: Diffusion-based Generative 3D Shape Completion
NEURIPS 2023
arXiv
41
citations
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
ICCV 2025
arXiv
41
citations
AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting
ECCV 2020
arXiv
25
citations
DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space
ICCV 2025
arXiv
23
citations
Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's-Eye View
ICCV 2023
arXiv
8
citations
Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
ECCV 2024
arXiv
7
citations
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer
ICCV 2025
arXiv
5
citations
MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation
ICCV 2023
0
citations
Watch Only Once: An End-to-End Video Action Detection Framework
ICCV 2021
0
citations
T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
NEURIPS 2023
0
citations