University of Bahrain
Scientific Journals

Outlier Handling in Clustering: A Comparative Experiment of K-Means, Robust Trimmed K-Means, and K-Means Least Trimmed Squared

Show simple item record

dc.contributor.author Estella, Tricia
dc.contributor.author Andrita Intan Ghayatrie, Nadzla
dc.contributor.author Wibowo, Antoni
dc.date.accessioned 2024-03-16T13:46:27Z
dc.date.available 2024-03-16T13:46:27Z
dc.date.issued 2024-03-14
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/5522
dc.description.abstract The presence of outliers in data often disrupts modeling results, especially in population clustering and behavioral analysis. Although there are various clustering algorithms that are robust to outliers, such as DBSCAN and t-SNE, K-Means still has challenges in dealing with them. This paper introduces an optimized K-Means LTS algorithm, which incorporates Least Trimmed Square technique to reduce outliers after clustering process. The outlier trimming process occurs after the clustering process, allowing for trimming within each cluster. This algorithm will be compared with K-Means and Robust Trimmed K-Means (RTKM), which both use outlier pruning. The comparison of these three algorithms will consider performance metrics using Silhouette Score and Davies-Bouldin Index, also the run time processes. As a result, K-Means LTS is consistently shown to perform better than K-Means and RTKM when implemented on ten various datasets. In the future, there may be further developments related to determining the best percentage for trimming outliers. en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.subject Clustering; Least Trimmed Squares; K-Means; Robust clustering; Noisy data; Outliers en_US
dc.title Outlier Handling in Clustering: A Comparative Experiment of K-Means, Robust Trimmed K-Means, and K-Means Least Trimmed Squared en_US
dc.identifier.doi http://dx.doi.org/10.12785/ijcds/160175
dc.volume 16 en_US
dc.issue 1 en_US
dc.pagestart 1029 en_US
dc.pageend 1039 en_US
dc.contributor.authorcountry Indonesia en_US
dc.contributor.authorcountry Indonesia en_US
dc.contributor.authorcountry Indonesia en_US
dc.contributor.authoraffiliation Master of Information Technology, BINUS Graduate Program, BINUS University en_US
dc.contributor.authoraffiliation Master of Information Technology, BINUS Graduate Program, BINUS University en_US
dc.contributor.authoraffiliation Master of Information Technology, BINUS Graduate Program, BINUS University en_US
dc.source.title International Journal of Computing and Digital Systems en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

This item appears in the following Issue(s)

Show simple item record

All Journals


Advanced Search

Browse

Administrator Account