Hierarchical Active Learning with Group Proportion Feedback

Hierarchical Active Learning with Group Proportion Feedback

Zhipeng Luo, Milos Hauskrecht

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
Main track. Pages 2532-2538. https://doi.org/10.24963/ijcai.2018/351

Learning of classification models in practice often relies on nontrivial human annotation effort in which humans assign class labels to data instances. As this process can be very time consuming and costly, finding effective ways to reduce the annotation cost becomes critical for building such models. In this work we solve this problem by exploring a new approach that actively learns classification models from groups, which are subpopulations of instances, and human feedback on the groups. Each group is labeled with a number in [0,1] interval representing a human estimate of the proportion of instances with one of the class labels in this subpopulation. To form the groups to be annotated, we develop a hierarchical active learning framework that divides the whole population into smaller subpopulations, which allows us to gradually learn more refined models from the subpopulations and their class proportion labels. Our extensive experiments on numerous datasets show that our method is competitive and outperforms existing approaches for reducing the human annotation cost.
Keywords:
Machine Learning: Active Learning
Machine Learning: Classification
Machine Learning: Cost-Sensitive Learning