α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Pengxiang Ding
Pengxiang Ding
9
papers
261
total citations
papers (9)
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
AAAI 2025
arXiv
110
citations
VLAS: Vision-Language-Action Model with Speech Instructions for Customized Robot Manipulation
ICLR 2025
arXiv
43
citations
ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
ICML 2025
arXiv
24
citations
CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
ICCV 2025
arXiv
24
citations
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
NEURIPS 2025
arXiv
21
citations
GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation
ICLR 2025
arXiv
17
citations
Expressive Forecasting of 3D Whole-Body Human Motions
AAAI 2024
arXiv
9
citations
PiTe: Pixel-Temporal Alignment for Large Video-Language Model
ECCV 2024
arXiv
9
citations
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
ICML 2025
arXiv
4
citations