α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Francesco Croce
Francesco Croce
11
papers
2,052
total citations
papers (11)
Square Attack: a query-efficient black-box adversarial attack via random search
ECCV 2020
arXiv
1,191
citations
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks
ICLR 2025
arXiv
401
citations
Diffusion Visual Counterfactual Explanations
NEURIPS 2022
arXiv
101
citations
Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threat Models
NEURIPS 2023
arXiv
96
citations
Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
ICML 2024
arXiv
88
citations
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
ICML 2024
arXiv
88
citations
OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents
NEURIPS 2025
arXiv
25
citations
Is In-Context Learning Sufficient for Instruction Following in LLMs?
ICLR 2025
arXiv
22
citations
Seasoning Model Soups for Robustness to Adversarial and Natural Distribution Shifts
CVPR 2023
arXiv
22
citations
Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models
ECCV 2024
arXiv
12
citations
Selective induction Heads: How Transformers Select Causal Structures in Context
ICLR 2025
arXiv
6
citations