Multi-label Feature Selection via Global Relevance and Redundancy Optimization

Multi-label Feature Selection via Global Relevance and Redundancy Optimization

Jia Zhang, Yidong Lin, Min Jiang, Shaozi Li, Yong Tang, Kay Chen Tan

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
Main track. Pages 2512-2518. https://doi.org/10.24963/ijcai.2020/348

Information theoretical based methods have attracted a great attention in recent years, and gained promising results to deal with multi-label data with high dimensionality. However, most of the existing methods are either directly transformed from heuristic single-label feature selection methods or inefficient in exploiting labeling information. Thus, they may not be able to get an optimal feature selection result shared by multiple labels. In this paper, we propose a general global optimization framework, in which feature relevance, label relevance (i.e., label correlation), and feature redundancy are taken into account, thus facilitating multi-label feature selection. Moreover, the proposed method has an excellent mechanism for utilizing inherent properties of multi-label learning. Specially, we provide a formulation to extend the proposed method with label-specific features. Empirical studies on twenty multi-label data sets reveal the effectiveness and efficiency of the proposed method. Our implementation of the proposed method is available online at: https://jiazhang-ml.pub/GRRO-master.zip.
Keywords:
Machine Learning: Feature Selection; Learning Sparse Models
Data Mining: Feature Extraction, Selection and Dimensionality Reduction