"3d perception" Papers
10 papers found
Conference
ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
Sandro Papais, Letian Wang, Brian Cheong et al.
ICCV 2025arXiv:2508.07089
Language-Image Models with 3D Understanding
Jang Hyun Cho, Boris Ivanovic, Yulong Cao et al.
ICLR 2025arXiv:2405.03685
27
citations
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Shihao Wang, Zhiding Yu, Xiaohui Jiang et al.
CVPR 2025arXiv:2504.04348
86
citations
PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter
Yaohua Zha, Yanzi Wang, Hang Guo et al.
CVPR 2025arXiv:2505.20941
3
citations
PointMAC: Meta-Learned Adaptation for Robust Test-Time Point Cloud Completion
Linlian Jiang, Rui Ma, Li Gu et al.
NEURIPS 2025arXiv:2510.10365
SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning
Wufei Ma, Yu-Cheng Chou, Qihao Liu et al.
NEURIPS 2025arXiv:2504.20024
23
citations
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li, Wenzhao Zheng, Xiaonan Huang et al.
ICLR 2025arXiv:2410.13864
4
citations
3D-VLA: A 3D Vision-Language-Action Generative World Model
Haoyu Zhen, Xiaowen Qiu, Peihao Chen et al.
ICML 2024arXiv:2403.09631
233
citations
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen, Yue Ma, Yu Qiao et al.
AAAI 2024paperarXiv:2312.12144
19
citations
UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
Jian Zou, Tianyu Huang, Guanglei Yang et al.
ECCV 2024
17
citations