α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Hao Tan
Hao Tan
OpenReview
29
papers
2,050
total citations
papers (29)
LRM: Large Reconstruction Model for Single Image to 3D
ICLR 2024
arXiv
711
citations
GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting
ECCV 2024
arXiv
250
citations
DMV3D: Denoising Multi-view Diffusion Using 3D Large Reconstruction Model
ICLR 2024
arXiv
227
citations
PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction
ICLR 2024
arXiv
155
citations
Scaling Data Generation in Vision-and-Language Navigation
ICCV 2023
arXiv
113
citations
EnvEdit: Environment Editing for Vision-and-Language Navigation
CVPR 2022
arXiv
108
citations
LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias
ICLR 2025
arXiv
96
citations
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
CVPR 2025
arXiv
63
citations
Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats
ICCV 2025
arXiv
58
citations
Learning Navigational Visual Representations with Semantic Map Supervision
ICCV 2023
arXiv
44
citations
LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers
AAAI 2025
arXiv
38
citations
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
NEURIPS 2021
arXiv
34
citations
Numerical Pruning for Efficient Autoregressive Models
AAAI 2025
arXiv
23
citations
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
CVPR 2024
arXiv
22
citations
RayZer: A Self-supervised Large View Synthesis Model
ICCV 2025
arXiv
21
citations
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
ICCV 2025
arXiv
18
citations
Compound Text-Guided Prompt Tuning via Image-Adaptive Cues
AAAI 2024
arXiv
14
citations
RelitLRM: Generative Relightable Radiance for Large Reconstruction Models
ICLR 2025
arXiv
11
citations
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
CVPR 2025
arXiv
10
citations
Gaussian Mixture Flow Matching Models
ICML 2025
arXiv
10
citations
Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models
AAAI 2025
arXiv
8
citations
Turbo3D: Ultra-fast Text-to-3D Generation
CVPR 2025
arXiv
7
citations
Generating 3D-Consistent Videos from Unposed Internet Photos
CVPR 2025
arXiv
4
citations
Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors
CVPR 2025
arXiv
4
citations
Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport
CVPR 2025
arXiv
1
citations
DiffTell: A High-Quality Dataset for Describing Image Manipulation Changes
ICCV 2025
0
citations
Efficient Federated Incomplete Multi-View Clustering
ICML 2025
0
citations
Large-scale Multi-view Tensor Clustering with Implicit Linear Kernels
CVPR 2025
0
citations
Building Vision-Language Models on Solid Foundations with Masked Distillation
CVPR 2024
0
citations