Binary Classification from Positive Data with Skewed Confidence

Kazuhiko Shinoda; Hirotaka Kaji; Masashi Sugiyama

doi:10.24963/ijcai.2020/460

Binary Classification from Positive Data with Skewed Confidence

Kazuhiko Shinoda, Hirotaka Kaji, Masashi Sugiyama

Short video

Long video

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence

Main track. Pages 3328-3334. https://doi.org/10.24963/ijcai.2020/460

PDF BibTeX

Positive-confidence (Pconf) classification [Ishida et al., 2018] is a promising weakly-supervised learning method which trains a binary classifier only from positive data equipped with confidence. However, in practice, the confidence may be skewed by bias arising in an annotation process. The Pconf classifier cannot be properly learned with skewed confidence, and consequently, the classification performance might be deteriorated. In this paper, we introduce the parameterized model of the skewed confidence, and propose the method for selecting the hyperparameter which cancels out the negative impact of the skewed confidence under the assumption that we have the misclassification rate of positive samples as a prior knowledge. We demonstrate the effectiveness of the proposed method through a synthetic experiment with simple linear models and benchmark problems with neural network models. We also apply our method to drivers’ drowsiness prediction to show that it works well with a real-world problem where confidence is obtained based on manual annotation.

Keywords:

Machine Learning Applications: Applications of Unsupervised Learning

Humans and AI: Personalization and User Modeling

Multidisciplinary Topics and Applications: Transportation