α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Michael Backes
Michael Backes
7
papers
52
total citations
papers (7)
Can't Steal? Cont-Steal! Contrastive Stealing Attacks Against Image Encoders
CVPR 2023
arXiv
46
citations
Captured by Captions: On Memorization and its Mitigation in CLIP Models
ICLR 2025
arXiv
4
citations
Finding and Reactivating Post-Trained LLMs' Hidden Safety Mechanisms
NEURIPS 2025
1
citations
Generating Less Certain Adversarial Examples Improves Robust Generalization
ICLR 2025
arXiv
1
citations
Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions
ICCV 2025
arXiv
0
citations
Position: TrustLLM: Trustworthiness in Large Language Models
ICML 2024
0
citations
Provably Cost-Sensitive Adversarial Defense via Randomized Smoothing
ICML 2025
arXiv
0
citations