Robustness of Autoencoders for Anomaly Detection Under Adversarial Impact

Robustness of Autoencoders for Anomaly Detection Under Adversarial Impact

Adam Goodge, Bryan Hooi, See Kiong Ng, Wee Siong Ng

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
Main track. Pages 1244-1250. https://doi.org/10.24963/ijcai.2020/173

Detecting anomalies is an important task in a wide variety of applications and domains. Deep learning methods have achieved state-of-the-art performance in anomaly detection in recent years; unsupervised methods being particularly popular. However, deep learning methods can be fragile to small perturbations in the input data. This can be exploited by an adversary to deliberately hinder model performance; an adversarial attack. This phenomena has been widely studied in the context of supervised image classification since its discovery, however such studies for an anomaly detection setting are sorely lacking. Moreover, the plethora of defense mechanisms that have been proposed are often not applicable to unsupervised anomaly detection models. In this work, we study the effect of adversarial attacks on the performance of anomaly-detecting autoencoders using real data from a Cyber physical system (CPS) testbed with intervals of controlled, physical attacks as anomalies. An adversary would attempt to disguise these points as normal through adversarial perturbations. To combat this, we propose the Approximate Projection Autoencoder (APAE), which incorporates two defenses against such attacks into a general autoencoder. One of these involves a novel technique to improve robustness under adversarial impact by optimising latent representations for better reconstruction outputs.
Keywords:
Data Mining: Mining Spatial, Temporal Data
Machine Learning Applications: Applications of Unsupervised Learning