α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Muzammal Naseer
Muzammal Naseer
23
papers
1,868
total citations
papers (23)
A Self-supervised Approach for Adversarial Robustness
CVPR 2020
arXiv
340
citations
GeoChat: Grounded Large Vision-Language Model for Remote Sensing
CVPR 2024
arXiv
319
citations
Self-regulating Prompts: Foundational Model Adaptation without Forgetting
ICCV 2023
arXiv
318
citations
Vita-CLIP: Video and Text Adaptive CLIP via Multimodal Prompting
CVPR 2023
arXiv
112
citations
Self-Supervised Video Transformer
CVPR 2022
arXiv
110
citations
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery
CVPR 2023
arXiv
106
citations
Orthogonal Projection Loss
ICCV 2021
arXiv
90
citations
On Generating Transferable Targeted Perturbations
ICCV 2021
arXiv
89
citations
Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery
CVPR 2024
arXiv
83
citations
FLIP: Cross-domain Face Anti-spoofing with Language Guidance
ICCV 2023
arXiv
77
citations
LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
ICLR 2024
arXiv
56
citations
CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search
CVPR 2023
arXiv
51
citations
Learning to Prompt with Text Only Supervision for Vision-Language Models
AAAI 2025
arXiv
42
citations
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
ICCV 2023
arXiv
28
citations
Composed Video Retrieval via Enriched Context and Discriminative Embeddings
CVPR 2024
arXiv
21
citations
STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models
CVPR 2025
arXiv
14
citations
DyCON: Dynamic Uncertainty-aware Consistency and Contrastive Learning for Semi-supervised Medical Image Segmentation
CVPR 2025
arXiv
8
citations
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
CVPR 2025
arXiv
2
citations
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection
CVPR 2025
arXiv
2
citations
Vision-Language Neural Graph Featurization for Extracting Retinal Lesions
ICCV 2025
0
citations
S3A: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
AAAI 2024
0
citations
VideoGrounding-DINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding
CVPR 2024
0
citations
MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation
ICCV 2025
arXiv
0
citations