α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Baoxiong Jia
Baoxiong Jia
20
papers
1,339
total citations
papers (20)
An Embodied Generalist Agent in 3D World
ICML 2024
arXiv
305
citations
Diffusion-Based Generation, Optimization, and Planning in 3D Scenes
CVPR 2023
arXiv
296
citations
EgoTaskQA: Understanding Human Tasks in Egocentric Videos
NEURIPS 2022
arXiv
99
citations
PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
CVPR 2024
arXiv
93
citations
Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution
CVPR 2021
arXiv
82
citations
Move as You Say Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
CVPR 2024
arXiv
81
citations
ARNOLD: A Benchmark for Language-Grounded Task Learning with Continuous States in Realistic 3D Scenes
ICCV 2023
arXiv
72
citations
LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities
ECCV 2020
arXiv
65
citations
ACRE: Abstract Causal REasoning Beyond Covariation
CVPR 2021
arXiv
55
citations
Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning
ECCV 2022
arXiv
45
citations
Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting
ICLR 2025
arXiv
39
citations
Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation
ICCV 2025
arXiv
28
citations
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
CVPR 2025
arXiv
18
citations
METASCENES: Towards Automated Replica Creation for Real-world 3D Scans
CVPR 2025
arXiv
13
citations
GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
ICCV 2025
arXiv
13
citations
SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent
NEURIPS 2025
arXiv
11
citations
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes
CVPR 2025
arXiv
9
citations
Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding
CVPR 2025
arXiv
6
citations
ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
NEURIPS 2023
arXiv
5
citations
X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events
ICCV 2023
arXiv
4
citations