AutoML for Outlier Detection with Optimal Transport Distances

AutoML for Outlier Detection with Optimal Transport Distances

Prabhant Singh, Joaquin Vanschoren

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
Demo Track. Pages 7175-7178. https://doi.org/10.24963/ijcai.2023/843

Automated machine learning (AutoML) has been widely researched and adopted for supervised problems, but progress in unsupervised settings has been limited. We propose `"LOTUS", a novel framework to automate outlier detection based on meta-learning. Our premise is that the selection of the optimal outlier detection technique depends on the inherent properties of the data distribution. We leverage optimal transport to find the dataset with the most similar underlying distribution, and then apply the outlier detection techniques that proved to work best for that data distribution. We evaluate the robustness of our framework and find that it outperforms all state-of-the-art automated outlier detection tools. This approach can also be easily generalized to automate other unsupervised settings.
Keywords:
Data Mining: DM: Anomaly/outlier detection
Machine Learning: ML: Automated machine learning