Descriptive Clustering: ILP and CP Formulations with Applications
Descriptive Clustering: ILP and CP Formulations with Applications
Thi-Bich-Hanh Dao, Chia-Tung Kuo, S. S. Ravi, Christel Vrain, Ian Davidson
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
Main track. Pages 1263-1269.
https://doi.org/10.24963/ijcai.2018/176
In many settings just finding a good clustering is insufficient and
an explanation of the clustering is required. If the features used
to perform the clustering are interpretable then methods such as
conceptual clustering can be used. However, in many applications
this is not the case particularly for image, graph and other complex
data. Here we explore the setting where a set of interpretable
discrete tags for each instance is available. We formulate the
descriptive clustering problem as a bi-objective optimization
to simultaneously find compact clusters using the features and to
describe them using the tags. We present our formulation in a
declarative platform and show it can be integrated into a standard
iterative algorithm to find all Pareto optimal solutions to the two
objectives. Preliminary results demonstrate the utility of our
approach on real data sets for images and electronic health care
records and that it outperforms single objective and multi-view clustering
baselines.
Keywords:
Constraints and SAT: Modeling;Formulation
Constraints and SAT: Constraints and Data Mining ; Machine Learning
Machine Learning: Clustering