Performance evaluation of similarity measures for K-means clustering algorithm

D.  Usman; S.F. Sani

doi:10.4314/bajopas.v12i2.21

download PDF

Published:

Feb 12, 2021

DOI:

10.4314/bajopas.v12i2.21

Keywords:

k-means clustering similarity measures squared euclidean distance manhattan distance

Issue

Vol. 12 No. 2 (2020)

Section

Articles

Copyright is owned by The Faculty of Science, Bayero University

D. Usman

S.F. Sani

Abstract

Clustering is a useful technique that organizes a large quantity of unordered datasets into a small number of meaningful and coherent clusters. Every clustering method is based on the index of similarity or dissimilarity between data points. However, the true intrinsic structure of the data could be correctly described by the similarity formula defined and embedded in the clustering criterion function. This paper uses squared Euclidean distance and Manhattan distance to investigates the best method for measuring similarity between data objects in sparse and high-dimensional domain which is fast, capable of providing high quality clustering result and consistent. The performances of these two methods were reported with simulated high dimensional datasets.

Bayero Journal of Pure and Applied Sciences
Journal / Bayero Journal of Pure and Applied Sciences / Vol. 12 No. 2 (2020) / Articles

Published:

DOI:

Keywords:

Performance evaluation of similarity measures for K-means clustering algorithm

D. Usman

S.F. Sani

Abstract

Journal Identifiers

Article Sidebar

Published:

DOI:

Keywords:

Article Details

Main Article Content

D. Usman

S.F. Sani

Abstract

Journal Identifiers