α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Shizhe Chen
Shizhe Chen
17
papers
1,752
total citations
papers (17)
Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning
CVPR 2020
arXiv
361
citations
History Aware Multimodal Transformer for Vision-and-Language Navigation
NEURIPS 2021
arXiv
317
citations
Say As You Wish: Fine-Grained Control of Image Caption Generation With Abstract Scene Graphs
CVPR 2020
arXiv
242
citations
Think Global, Act Local: Dual-Scale Graph Transformer for Vision-and-Language Navigation
CVPR 2022
arXiv
213
citations
Airbert: In-Domain Pretraining for Vision-and-Language Navigation
ICCV 2021
arXiv
170
citations
Language Conditioned Spatial Relation Reasoning for 3D Object Grounding
NEURIPS 2022
arXiv
133
citations
Elaborative Rehearsal for Zero-Shot Action Recognition
ICCV 2021
arXiv
108
citations
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
ECCV 2022
arXiv
59
citations
gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction
CVPR 2023
arXiv
59
citations
Towards Diverse Paragraph Captioning for Untrimmed Videos
CVPR 2021
arXiv
42
citations
SUGAR: Pre-training 3D Visual Representations for Robotics
CVPR 2024
arXiv
37
citations
NextBestPath: Efficient 3D Mapping of Unseen Environments
ICLR 2025
arXiv
4
citations
HORT: Monocular Hand-held Objects Reconstruction with Transformers
ICCV 2025
arXiv
4
citations
Explore and Tell: Embodied Visual Captioning in 3D Environments
ICCV 2023
arXiv
3
citations
Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning
ECCV 2022
0
citations
VRDFormer: End-to-End Video Visual Relation Detection With Transformers
CVPR 2022
0
citations
Sketch, Ground, and Refine: Top-Down Dense Video Captioning
CVPR 2021
0
citations