α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jiangmiao Pang
Jiangmiao Pang
28
papers
3,401
total citations
papers (28)
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking
CVPR 2023
arXiv
744
citations
Quasi-Dense Similarity Learning for Multiple Object Tracking
CVPR 2021
arXiv
450
citations
K-Net: Towards Unified Image Segmentation
NEURIPS 2021
arXiv
448
citations
PointLLM: Empowering Large Language Models to Understand Point Clouds
ECCV 2024
arXiv
295
citations
Seesaw Loss for Long-Tailed Instance Segmentation
CVPR 2021
arXiv
274
citations
Dense Distinct Query for End-to-End Object Detection
CVPR 2023
arXiv
223
citations
Side-Aware Boundary Localization for More Precise Object Detection
ECCV 2020
arXiv
153
citations
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
CVPR 2024
arXiv
130
citations
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities
ICCV 2025
127
citations
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
CVPR 2022
arXiv
111
citations
Unified Human-Scene Interaction via Prompted Chain-of-Contacts
ICLR 2024
arXiv
101
citations
Monocular 3D Object Detection with Depth from Motion
ECCV 2022
arXiv
69
citations
Aether: Geometric-Aware Unified World Modeling
ICCV 2025
arXiv
50
citations
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
CVPR 2023
arXiv
37
citations
OV-PARTS: Towards Open-Vocabulary Part Segmentation
NEURIPS 2023
arXiv
36
citations
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
CVPR 2024
arXiv
35
citations
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation
ICCV 2023
arXiv
28
citations
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
CVPR 2025
arXiv
17
citations
Dense Siamese Network for Dense Unsupervised Learning
ECCV 2022
arXiv
16
citations
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs
NEURIPS 2025
arXiv
11
citations
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
CVPR 2025
arXiv
8
citations
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
ICCV 2025
arXiv
7
citations
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities
ICCV 2025
arXiv
7
citations
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
NEURIPS 2025
arXiv
7
citations
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene
ICCV 2025
arXiv
6
citations
VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
ICCV 2025
arXiv
5
citations
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
CVPR 2025
arXiv
4
citations
LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents
NEURIPS 2025
arXiv
2
citations