Learning Robust and Privacy-Preserving Representations via Information Theory

4citations

arXiv:2412.11066 PDF Project

citations

#1023

in AAAI 2025

of 3028 papers

Top Authors

Data Points

Top Authors

Binghui Zhang Sayedeh Leila Noorbakhsh Yun Dong Yuan Hong Binghui Wang

Abstract

Machine learning models are vulnerable to both security attacks (e.g., adversarial examples) and privacy attacks (e.g., private attribute inference). We take the first step to mitigate both the security and privacy attacks, and maintain task utility as well. Particularly, we propose an information-theoretic framework to achieve the goals through the lens of representation learning, i.e., learning representations that are robust to both adversarial examples and attribute inference adversaries. We also derive novel theoretical results under our framework, e.g., the inherent trade-off between adversarial robustness/utility and attribute privacy, and guaranteed attribute privacy leakage against attribute inference adversaries.

Citation History

Jan 28, 2026

Feb 13, 2026

4+4

Feb 13, 2026