α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Gang Yu
Gang Yu
1
Affiliations
Affiliations
Tencent
26
papers
3,342
total citations
papers (26)
Executing Your Commands via Motion Diffusion in Latent Space
CVPR 2023
arXiv
545
citations
MotionGPT: Human Motion as a Foreign Language
NEURIPS 2023
arXiv
466
citations
High-Order Information Matters: Learning Relation and Topology for Occluded Person Re-Identification
CVPR 2020
arXiv
444
citations
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
ICCV 2023
arXiv
327
citations
Context Prior for Scene Segmentation
CVPR 2020
arXiv
286
citations
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
CVPR 2022
arXiv
265
citations
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
NEURIPS 2023
arXiv
174
citations
State-Aware Tracker for Real-Time Video Object Segmentation
CVPR 2020
arXiv
120
citations
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
CVPR 2024
arXiv
113
citations
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers
ICLR 2025
arXiv
102
citations
End-to-End 3D Dense Captioning With Vote2Cap-DETR
CVPR 2023
arXiv
88
citations
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
AAAI 2024
arXiv
75
citations
STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection
CVPR 2023
arXiv
64
citations
Hierarchical Normalization for Robust Monocular Depth Estimation
NEURIPS 2022
arXiv
62
citations
D&D: Learning Human Dynamics from Dynamic Camera
ECCV 2022
arXiv
49
citations
A Large-Scale Outdoor Multi-Modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction
ICCV 2023
arXiv
47
citations
KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models
NEURIPS 2025
arXiv
30
citations
MotionChain: Conversational Motion Controllers via Multimodal Prompts
ECCV 2024
arXiv
22
citations
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
CVPR 2025
arXiv
21
citations
Coordinates Are NOT Lonely - Codebook Prior Helps Implicit Neural 3D representations
NEURIPS 2022
arXiv
20
citations
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering
ICCV 2023
arXiv
7
citations
DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models
CVPR 2025
arXiv
6
citations
PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation
NEURIPS 2023
arXiv
5
citations
M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions
ECCV 2024
4
citations
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning
CVPR 2024
0
citations
PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation
AAAI 2024
0
citations