Online Semi-supervised Multi-label Classification with Label Compression and Local Smooth Regression

Online Semi-supervised Multi-label Classification with Label Compression and Local Smooth Regression

Peiyan Li, Honglian Wang, Christian Böhm, Junming Shao

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
Main track. Pages 1359-1365. https://doi.org/10.24963/ijcai.2020/189

Online semi-supervised multi-label classification serves a practical yet challenging task since only a small number of labeled instances are available in real streaming environments. However, the mainstream of existing online classification techniques are focused on the single-label case, while only a few multi-label stream classification algorithms exist, and they are mainly trained on labeled instances. In this paper, we present a novel Online Semi-supervised Multi-Label learning algorithm (OnSeML) based on label compression and local smooth regression, which allows real-time multi-label predictions in a semi-supervised setting and is robust to evolving label distributions. Specifically, to capture the high-order label relationship and to build a compact target space for regression, OnSeML compresses the label set into a low-dimensional space by a fixed orthogonal label encoder. Then a locally defined regression function for each incoming instance is obtained with a closed-form solution. Targeting the evolving label distribution problem, we propose an adaptive decoding scheme to adequately integrate newly arriving labeled data. Extensive experiments provide empirical evidence for the effectiveness of our approach.
Keywords:
Data Mining: Classification, Semi-Supervised Learning
Data Mining: Mining Data Streams