Kno.e.sis Publications

A Clustering Comparison Measure Using Density Profiles and its Application to the Discovery of Alternate Clusterings

Eric Bae
James Bailey
Guozhu Dong, Wright State University - Main CampusFollow

Document Type

Article

Publication Date

11-2010

Abstract

Data clustering is a fundamental and very popular method of data analysis. Its subjective nature, however, means that different clustering algorithms or different parameter settings can produce widely varying and sometimes conflicting results. This has led to the use of clustering comparison measures to quantify the degree of similarity between alternative clusterings. Existing measures, though, can be limited in their ability to assess similarity and sometimes generate unintuitive results. They also cannot be applied to compare clusterings which contain different data points, an activity which is important for scenarios such as data stream analysis. In this paper, we introduce a new clustering similarity measure, known as ADCO, which aims to address some limitations of existing measures, by allowing greater flexibility of comparison via the use of density profiles to characterize a clustering. In particular, it adopts a ‘data mining style’ philosophy to clustering comparison, whereby two clusterings are considered to be more similar, if they are likely to give rise to similar types of prediction models. Furthermore, we show that this new measure can be applied as a highly effective objective function within a new algorithm, known as MAXIMUS, for generating alternate clusterings.

Comments

Attached is the unpublished, peer-reviewed authors' version of the article. The final publisher's version of the article is available at http://dx.doi.org/10.1007/s10618-009-0164-z.

Repository Citation

Bae, E., Bailey, J., & Dong, G. (2010). A Clustering Comparison Measure Using Density Profiles and its Application to the Discovery of Alternate Clusterings. Data Mining and Knowledge Discovery, 21 (3), 427-471.
https://corescholar.libraries.wright.edu/knoesis/423

DOI

10.1007/s10618-009-0164-z

Download

Included in

Other Computer Sciences Commons

COinS

Kno.e.sis Publications

A Clustering Comparison Measure Using Density Profiles and its Application to the Discovery of Alternate Clusterings

Document Type

Publication Date

Abstract

Comments

Repository Citation

DOI

Included in

Search

Browse

About

SelectedWorks Sites

Kno.e.sis Publications

A Clustering Comparison Measure Using Density Profiles and its Application to the Discovery of Alternate Clusterings

Authors

Document Type

Publication Date

Abstract

Comments

Repository Citation

DOI

Included in

Share

Search

Browse

About

SelectedWorks Sites