α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Hartwig Adam
Hartwig Adam
1
Affiliations
Affiliations
Google DeepMind
21
papers
3,514
total citations
papers (21)
Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation
ECCV 2020
arXiv
789
citations
Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation
CVPR 2020
arXiv
662
citations
MaX-DeepLab: End-to-End Panoptic Segmentation With Mask Transformers
CVPR 2021
arXiv
597
citations
VideoPoet: A Large Language Model for Zero-Shot Video Generation
ICML 2024
arXiv
420
citations
VIP-DeepLab: Learning Visual Perception With Depth-Aware Video Panoptic Segmentation
CVPR 2021
arXiv
165
citations
Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation
ECCV 2020
arXiv
118
citations
Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset
ECCV 2020
arXiv
110
citations
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
CVPR 2022
arXiv
105
citations
View-Invariant Probabilistic Embedding for Human Pose
ECCV 2020
arXiv
88
citations
Adaptive Transformers for Robust Few-Shot Cross-Domain Face Anti-Spoofing
ECCV 2022
arXiv
79
citations
VideoPrism: A Foundational Visual Encoder for Video Understanding
ICML 2024
arXiv
73
citations
MnasFPN: Learning Latency-Aware Pyramid Architecture for Object Detection on Mobile Devices
CVPR 2020
arXiv
57
citations
Improving Zero-Shot Generalization and Robustness of Multi-Modal Models
CVPR 2023
arXiv
56
citations
TubeFormer-DeepLab: Video Mask Transformer
CVPR 2022
arXiv
50
citations
Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization
CVPR 2021
arXiv
34
citations
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
NEURIPS 2023
arXiv
26
citations
Distilling Vision-Language Models on Millions of Videos
CVPR 2024
arXiv
21
citations
k-Means Mask Transformer
ECCV 2022
arXiv
19
citations
Contextualized Spatio-Temporal Contrastive Learning With Self-Supervision
CVPR 2022
arXiv
18
citations
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
ECCV 2022
arXiv
15
citations
Unified Visual Relationship Detection with Vision and Language Models
ICCV 2023
arXiv
12
citations