Learning Groupwise Explanations for Black-Box Models

Learning Groupwise Explanations for Black-Box Models

Jingyue Gao, Xiting Wang, Yasha Wang, Yulan Yan, Xing Xie

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence
Main Track. Pages 2396-2402. https://doi.org/10.24963/ijcai.2021/330

We study two user demands that are important during the exploitation of explanations in practice: 1) understanding the overall model behavior faithfully with limited cognitive load and 2) predicting the model behavior accurately on unseen instances. We illustrate that the two user demands correspond to two major sub-processes in the human cognitive process and propose a unified framework to fulfill them simultaneously. Given a local explanation method, our framework jointly 1) learns a limited number of groupwise explanations that interpret the model behavior on most instances with high fidelity and 2) specifies the region where each explanation applies. Experiments on six datasets demonstrate the effectiveness of our method.
Keywords:
Machine Learning: Explainable/Interpretable Machine Learning