"model security" Papers
8 papers found
Conference
Activation Gradient based Poisoned Sample Detection Against Backdoor Attacks
Danni Yuan, Mingda Zhang, Shaokui Wei et al.
ICLR 2025arXiv:2312.06230
11
citations
CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers
Jingyi Zheng, Tianyi Hu, Tianshuo Cong et al.
AAAI 2025paperarXiv:2412.19037
12
citations
Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attack on Breast Ultrasound Images
Yasamin Medghalchi, Moein Heidari, Clayton Allard et al.
CVPR 2025arXiv:2412.09910
4
citations
Rethinking Byzantine Robustness in Federated Recommendation from Sparse Aggregation Perspective
Zhongjian Zhang, Mengmei Zhang, Xiao Wang et al.
AAAI 2025paperarXiv:2501.03301
3
citations
Backdoor Attacks via Machine Unlearning
Zihao Liu, Tianhao Wang, Mengdi Huai et al.
AAAI 2024paperarXiv:2510.13322
Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models
Vitali Petsiuk, Kate Saenko
ECCV 2024arXiv:2404.13706
8
citations
Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normalization
Xingyi Zhao, Depeng Xu, Shuhan Yuan
ICML 2024
Elijah: Eliminating Backdoors Injected in Diffusion Models via Distribution Shift
Shengwei An, Sheng-Yen Chou, Kaiyuan Zhang et al.
AAAI 2024paperarXiv:2312.00050
43
citations