On Gleaning Knowledge from Multiple Domains for Active Learning

On Gleaning Knowledge from Multiple Domains for Active Learning

Zengmao Wang, Bo Du, Lefei Zhang, Liangpei Zhang, Ruimin Hu, Dacheng Tao

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
Main track. Pages 3013-3019. https://doi.org/10.24963/ijcai.2017/420

How can a doctor diagnose new diseases with little historical knowledge, which are emerging over time? Active learning is a promising way to address the problem by querying the most informative samples. Since the diagnosed cases for new disease are very limited, gleaning knowledge from other domains (classical prescriptions) to prevent the bias of active leaning would be vital for accurate diagnosis. In this paper, a framework that attempts to glean knowledge from multiple domains for active learning by querying the most uncertain and representative samples from the target domain and calculating the importance weights for re-weighting the source data in a single unified formulation is proposed. The weights are optimized by both a supervised classifier and distribution matching between the source domain and target domain with maximum mean discrepancy. Besides, a multiple domains active learning method is designed based on the proposed framework as an example. The proposed method is verified with newsgroups and handwritten digits data recognition tasks, where it outperforms the state-of-the-art methods.
Keywords:
Machine Learning: Active Learning
Machine Learning: Classification
Machine Learning: Machine Learning