α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xingang Wang
Xingang Wang
18
papers
1,167
total citations
papers (18)
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
ICCV 2023
arXiv
239
citations
Learning Dynamic Routing for Semantic Segmentation
CVPR 2020
arXiv
184
citations
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
AAAI 2025
arXiv
146
citations
MVSTER: Epipolar Transformer for Efficient Multi-View Stereo
ECCV 2022
arXiv
124
citations
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
CVPR 2023
arXiv
124
citations
DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
CVPR 2025
arXiv
87
citations
ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
CVPR 2025
arXiv
58
citations
DiffBEV: Conditional Diffusion Model for Bird’s Eye View Perception
AAAI 2024
arXiv
36
citations
Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation
AAAI 2024
arXiv
33
citations
Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark
CVPR 2023
arXiv
28
citations
Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection
CVPR 2025
arXiv
26
citations
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
ICCV 2025
arXiv
26
citations
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Videos Generation
NEURIPS 2025
arXiv
26
citations
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
CVPR 2025
arXiv
18
citations
Multi-Granularity Distillation Scheme towards Lightweight Semi-Supervised Semantic Segmentation
ECCV 2022
arXiv
7
citations
DictAS: A Framework for Class-Generalizable Few-Shot Anomaly Segmentation via Dictionary Lookup
ICCV 2025
arXiv
2
citations
Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection
CVPR 2025
arXiv
2
citations
DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding
ICCV 2025
arXiv
1
citations