α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xiaojian Ma
Xiaojian Ma
11
papers
819
total citations
papers (11)
An Embodied Generalist Agent in 3D World
ICML 2024
arXiv
305
citations
3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
ICCV 2023
arXiv
216
citations
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions
CVPR 2022
arXiv
51
citations
CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update
CVPR 2024
arXiv
47
citations
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
CVPR 2023
arXiv
46
citations
Unsupervised Foreground Extraction via Deep Region Competition
NEURIPS 2021
arXiv
45
citations
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage
ICLR 2025
arXiv
38
citations
Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation
ICCV 2025
arXiv
28
citations
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
ICLR 2024
arXiv
19
citations
Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding
ICCV 2025
arXiv
12
citations
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting
CVPR 2025
arXiv
12
citations