Binary Classification from Positive Data with Skewed Confidence

Binary Classification from Positive Data with Skewed Confidence

Kazuhiko Shinoda, Hirotaka Kaji, Masashi Sugiyama

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
Main track. Pages 3328-3334. https://doi.org/10.24963/ijcai.2020/460

Positive-confidence (Pconf) classification [Ishida et al., 2018] is a promising weakly-supervised learning method which trains a binary classifier only from positive data equipped with confidence. However, in practice, the confidence may be skewed by bias arising in an annotation process. The Pconf classifier cannot be properly learned with skewed confidence, and consequently, the classification performance might be deteriorated. In this paper, we introduce the parameterized model of the skewed confidence, and propose the method for selecting the hyperparameter which cancels out the negative impact of the skewed confidence under the assumption that we have the misclassification rate of positive samples as a prior knowledge. We demonstrate the effectiveness of the proposed method through a synthetic experiment with simple linear models and benchmark problems with neural network models. We also apply our method to drivers’ drowsiness prediction to show that it works well with a real-world problem where confidence is obtained based on manual annotation.
Keywords:
Machine Learning Applications: Applications of Unsupervised Learning
Humans and AI: Personalization and User Modeling
Multidisciplinary Topics and Applications: Transportation