Affinity Learning for Mixed Data Clustering

Nan Li; Longin Jan Latecki

Affinity Learning for Mixed Data Clustering

Nan Li, Longin Jan Latecki

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence

Main track. Pages 2173-2179. https://doi.org/10.24963/ijcai.2017/302

PDF BibTeX

In this paper, we propose a novel affinity learning based framework for mixed data clustering, which includes: how to process data with mixed-type attributes, how to learn affinities between data points, and how to exploit the learned affinities for clustering. In the proposed framework, each original data attribute is represented with several abstract objects defined according to the specific data type and values. Each attribute value is transformed into the initial affinities between the data point and the abstract objects of attribute. We refine these affinities and infer the unknown affinities between data points by taking into account the interconnections among the attribute values of all data points. The inferred affinities between data points can be exploited for clustering. Alternatively, the refined affinities between data points and the abstract objects of attributes can be transformed into new data features for clustering. Experimental results on many real world data sets demonstrate that the proposed framework is effective for mixed data clustering.

Keywords:

Machine Learning: Data Mining

Machine Learning: Unsupervised Learning