Curriculum Adversarial Training

Qi-Zhi Cai; Chang Liu; Dawn Song

Curriculum Adversarial Training

Qi-Zhi Cai, Chang Liu, Dawn Song

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence

Main track. Pages 3740-3747. https://doi.org/10.24963/ijcai.2018/520

PDF BibTeX

Recently, deep learning has been applied to many security-sensitive applications, such as facial authentication. The existence of adversarial examples hinders such applications. The state-of-the-art result on defense shows that adversarial training can be applied to train a robust model on MNIST against adversarial examples; but it fails to achieve a high empirical worst-case accuracy on a more complex task, such as CIFAR-10 and SVHN. In our work, we propose curriculum adversarial training (CAT) to resolve this issue. The basic idea is to develop a curriculum of adversarial examples generated by attacks with a wide range of strengths. With two techniques to mitigate the catastrophic forgetting and the generalization issues, we demonstrate that CAT can improve the prior art's empirical worst-case accuracy by a large margin of 25% on CIFAR-10 and 35% on SVHN. At the same, the model's performance on non-adversarial inputs is comparable to the state-of-the-art models.

Keywords:

Multidisciplinary Topics and Applications: Security and Privacy

Machine Learning: Deep Learning