Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets

Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets

Homayun Afrabandpey, Tomi Peltola, Samuel Kaski

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
Main track. Pages 1959-1966. https://doi.org/10.24963/ijcai.2019/271

Learning predictive models from small high-dimensional data sets is a key problem in high-dimensional statistics. Expert knowledge elicitation can help, and a strong line of work focuses on directly eliciting informative prior distributions for parameters. This either requires considerable statistical expertise or is laborious, as the emphasis has been on accuracy and not on efficiency of the process. Another line of work queries about importance of features one at a time, assuming them to be independent and hence missing covariance information. In contrast, we propose eliciting expert knowledge about pairwise feature similarities, to borrow statistical strength in the predictions, and using sequential decision making techniques to minimize the effort of the expert. Empirical results demonstrate improvement in predictive performance on both simulated and real data, in high-dimensional linear regression tasks, where we learn the covariance structure with a Gaussian process, based on sequential elicitation.
Keywords:
Machine Learning: Active Learning
Humans and AI: Human-Computer Interaction
Machine Learning: Probabilistic Machine Learning