by Ze Huang Papers
3 papers found
Conference
4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration
Jiahui Zhang, Yurui Chen, Yueming Xu et al.
NEURIPS 2025oralarXiv:2506.22242
18
citations
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
Jiahui Zhang, Yurui Chen, Yueming Xu et al.
NEURIPS 2025arXiv:2503.22976
40
citations
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation
Jiachen Lu, Ze Huang, Zeyu Yang et al.
ECCV 2024arXiv:2312.02934
76
citations