Counterexample-Guided Data Augmentation

Counterexample-Guided Data Augmentation

Tommaso Dreossi, Shromona Ghosh, Xiangyu Yue, Kurt Keutzer, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
Main track. Pages 2071-2078. https://doi.org/10.24963/ijcai.2018/286

We present a novel framework for augmenting data sets for machine learning based on counterexamples. Counterexamples are misclassified examples that have important properties for retraining and improving the model. Key components of our framework include a \textit{counterexample generator}, which produces data items that are misclassified by the model and error tables, a novel data structure that stores information pertaining to misclassifications. Error tables can be used to explain the model's vulnerabilities and are used to efficiently generate counterexamples for augmentation. We show the efficacy of the proposed framework by comparing it to classical augmentation techniques on a case study of object detection in autonomous driving based on deep neural networks.
Keywords:
Machine Learning: Neural Networks
Machine Learning: Deep Learning
Multidisciplinary Topics and Applications: Validation and Verification
Computer Vision: Recognition: Detection, Categorization, Indexing, Matching, Retrieval, Semantic Interpretation
Computer Vision: Big Data and Large Scale Methods
Computer Vision: Statistical Methods and Machine Learning