α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Hisham Cholakkal
Hisham Cholakkal
25
papers
1,229
total citations
papers (25)
GLaMM: Pixel Grounding Large Multimodal Model
CVPR 2024
arXiv
411
citations
SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation
ECCV 2020
arXiv
188
citations
Person Image Synthesis via Denoising Diffusion Model
CVPR 2023
arXiv
153
citations
Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery
CVPR 2024
arXiv
83
citations
PSTR: End-to-End One-Step Person Search With Transformers
CVPR 2022
arXiv
73
citations
Handwriting Transformers
ICCV 2021
arXiv
62
citations
D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations
ICCV 2021
arXiv
61
citations
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
CVPR 2025
arXiv
44
citations
Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection
CVPR 2023
arXiv
35
citations
DoodleFormer: Creative Sketch Drawing with Transformers
ECCV 2022
arXiv
33
citations
Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer
ECCV 2022
arXiv
18
citations
Semi-supervised Open-World Object Detection
AAAI 2024
arXiv
15
citations
Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation
ICCV 2023
arXiv
12
citations
Handling Data Heterogeneity via Architectural Design for Federated Visual Recognition
NEURIPS 2023
arXiv
9
citations
3D Indoor Instance Segmentation in an Open-World
NEURIPS 2023
arXiv
8
citations
PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model
ECCV 2024
arXiv
7
citations
CONDA: Condensed Deep Association Learning for Co-Salient Object Detection.
ECCV 2024
arXiv
7
citations
Generative Multiplane Neural Radiance for 3D-Aware Image Generation
ICCV 2023
arXiv
5
citations
TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models
ICCV 2025
arXiv
3
citations
Adapting In-Domain Few-Shot Segmentation to New Domains without Source Domain Retraining
ICCV 2025
arXiv
2
citations
Bidirectional Reciprocative Information Communication for Few-Shot Semantic Segmentation
ICML 2024
0
citations
DEFT: Decompositional Efficient Fine-Tuning for Text-to-Image Models
NEURIPS 2025
arXiv
0
citations
Fixing Localization Errors to Improve Image Classification
ECCV 2020
0
citations
D2Det: Towards High Quality Object Detection and Instance Segmentation
CVPR 2020
0
citations
Count- and Similarity-aware R-CNN for Pedestrian Detection
ECCV 2020
0
citations