Performances of K-Means Clustering Algorithm with Different Distance Metrics

Ghazal, T M, Hussain, M Z, Said, R A, Nadeem, A, Hasan, M K, Ahmad, M and Khan, M A (2021) Performances of K-Means Clustering Algorithm with Different Distance Metrics. Intelligent Automation & Soft Computing, 30 (2). pp. 735-742. ISSN 2326-005X

[thumbnail of 20.pdf] Text
20.pdf - Published Version

Download (647kB)

Abstract

Clustering is the process of grouping the data based on their similar properties. Meanwhile, it is the categorization of a set of data into similar groups (clusters), and the elements in each cluster share similarities, where the similarity between elements in the same cluster must be smaller enough to the similarity between elements of different clusters. Hence, this similarity can be considered as a distance measure. One of the most popular clustering algorithms is K-means, where distance is measured between every point of the dataset and centroids of clusters to find similar data objects and assign them to the nearest cluster. Further, there are a series of distance metrics that can be applied to calculate point-to-point distances. In this research, the K-means clustering algorithm is evaluated with three different mathematical metrics in terms of execution time with different datasets and different numbers of clusters. The results indicate that the implementation of Manhattan distance measure metrics achieves the best results in most cases. These results also demonstrate that distance metrics can affect the execution time and the number of clusters created by the K-means algorithm.

Affiliation: Skyline University College
SUC Author(s): Ghazal, T M ORCID: https://orcid.org/0000-0003-0672-7924
All Author(s): Ghazal, T M, Hussain, M Z, Said, R A, Nadeem, A, Hasan, M K, Ahmad, M and Khan, M A
Item Type: Article
Uncontrolled Keywords: K-means clustering; distance metrics; Euclidean distance; Manhattan distance; Minkowski distance
Subjects: B Information Technology > BA Information Systems
Divisions: Skyline University College > School of IT
Depositing User: Mr Veeramani Rasu
Date Deposited: 11 Feb 2022 12:27
Last Modified: 18 Jan 2024 07:17
URI: https://research.skylineuniversity.ac.ae/id/eprint/81
Publisher URL: https://doi.org/10.32604/iasc.2021.019067
Publisher OA policy: https://v2.sherpa.ac.uk/id/publication/37361
Related URLs:

Actions (login required)

View Item
View Item
Statistics for SkyRep ePrint 81 Statistics for this ePrint Item