by Yanpeng Zhou Papers
3 papers found
Conference
4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration
Jiahui Zhang, Yurui Chen, Yueming Xu et al.
NEURIPS 2025oralarXiv:2506.22242
18
citations
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
Jiahui Zhang, Yurui Chen, Yueming Xu et al.
NEURIPS 2025arXiv:2503.22976
40
citations
UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting
Haoyuan Li, Yanpeng Zhou, Tao Tang et al.
ICLR 2025arXiv:2502.17860
5
citations