CAMH: Advancing Model Hijacking Attack in Machine Learning

0citations

arXiv:2408.13741 PDF Project

citations

#2074

in AAAI 2025

of 3028 papers

Top Authors

Data Points

Top Authors

Xing He Jiahao Chen Yuwen Pu Qingming Li Chunyi Zhou Yingcai Wu Jinbao Li Shouling Ji

Topics

model hijacking attacks adversarial machine learning synchronized training layers dual-loop optimization security vulnerabilities pre-trained models task performance balance category-agnostic attacks

Abstract

In the burgeoning domain of machine learning, the reliance on third-party services for model training and the adoption of pre-trained models have surged. However, this reliance introduces vulnerabilities to model hijacking attacks, where adversaries manipulate models to perform unintended tasks, leading to significant security and ethical concerns, like turning an ordinary image classifier into a tool for detecting faces in pornographic content, all without the model owner's knowledge. This paper introduces Category-Agnostic Model Hijacking (CAMH), a novel model hijacking attack method capable of addressing the challenges of class number mismatch, data distribution divergence, and performance balance between the original and hijacking tasks. CAMH incorporates synchronized training layers, random noise optimization, and a dual-loop optimization approach to ensure minimal impact on the original task's performance while effectively executing the hijacking task. We evaluate CAMH across multiple benchmark datasets and network architectures, demonstrating its potent attack effectiveness while ensuring minimal degradation in the performance of the original task.

Citation History

Jan 27, 2026

Feb 13, 2026