"3d perception" Papers

10 papers found

ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Sandro Papais, Letian Wang, Brian Cheong et al.

ICCV 2025arXiv:2508.07089

Language-Image Models with 3D Understanding

Jang Hyun Cho, Boris Ivanovic, Yulong Cao et al.

ICLR 2025arXiv:2405.03685
27
citations

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

Shihao Wang, Zhiding Yu, Xiaohui Jiang et al.

CVPR 2025arXiv:2504.04348
86
citations

PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter

Yaohua Zha, Yanzi Wang, Hang Guo et al.

CVPR 2025arXiv:2505.20941
3
citations

PointMAC: Meta-Learned Adaptation for Robust Test-Time Point Cloud Completion

Linlian Jiang, Rui Ma, Li Gu et al.

NEURIPS 2025arXiv:2510.10365

SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning

Wufei Ma, Yu-Cheng Chou, Qihao Liu et al.

NEURIPS 2025arXiv:2504.20024
23
citations

UniDrive: Towards Universal Driving Perception Across Camera Configurations

Ye Li, Wenzhao Zheng, Xiaonan Huang et al.

ICLR 2025arXiv:2410.13864
4
citations

3D-VLA: A 3D Vision-Language-Action Generative World Model

Haoyu Zhen, Xiaowen Qiu, Peihao Chen et al.

ICML 2024arXiv:2403.09631
233
citations

M-BEV: Masked BEV Perception for Robust Autonomous Driving

Siran Chen, Yue Ma, Yu Qiao et al.

AAAI 2024paperarXiv:2312.12144
19
citations

UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving

Jian Zou, Tianyu Huang, Guanglei Yang et al.

ECCV 2024
17
citations