dc.contributor.author |
Estella, Tricia |
|
dc.contributor.author |
Andrita Intan Ghayatrie, Nadzla |
|
dc.contributor.author |
Wibowo, Antoni |
|
dc.date.accessioned |
2024-03-16T13:46:27Z |
|
dc.date.available |
2024-03-16T13:46:27Z |
|
dc.date.issued |
2024-03-14 |
|
dc.identifier.issn |
2210-142X |
|
dc.identifier.uri |
https://journal.uob.edu.bh:443/handle/123456789/5522 |
|
dc.description.abstract |
The presence of outliers in data often disrupts modeling results, especially in population clustering and behavioral analysis.
Although there are various clustering algorithms that are robust to outliers, such as DBSCAN and t-SNE, K-Means still has challenges
in dealing with them. This paper introduces an optimized K-Means LTS algorithm, which incorporates Least Trimmed Square technique
to reduce outliers after clustering process. The outlier trimming process occurs after the clustering process, allowing for trimming within
each cluster. This algorithm will be compared with K-Means and Robust Trimmed K-Means (RTKM), which both use outlier pruning.
The comparison of these three algorithms will consider performance metrics using Silhouette Score and Davies-Bouldin Index, also the
run time processes. As a result, K-Means LTS is consistently shown to perform better than K-Means and RTKM when implemented on
ten various datasets. In the future, there may be further developments related to determining the best percentage for trimming outliers. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
University of Bahrain |
en_US |
dc.subject |
Clustering; Least Trimmed Squares; K-Means; Robust clustering; Noisy data; Outliers |
en_US |
dc.title |
Outlier Handling in Clustering: A Comparative Experiment of K-Means, Robust Trimmed K-Means, and K-Means Least Trimmed Squared |
en_US |
dc.identifier.doi |
http://dx.doi.org/10.12785/ijcds/160175 |
|
dc.volume |
16 |
en_US |
dc.issue |
1 |
en_US |
dc.pagestart |
1029 |
en_US |
dc.pageend |
1039 |
en_US |
dc.contributor.authorcountry |
Indonesia |
en_US |
dc.contributor.authorcountry |
Indonesia |
en_US |
dc.contributor.authorcountry |
Indonesia |
en_US |
dc.contributor.authoraffiliation |
Master of Information Technology, BINUS Graduate Program, BINUS University |
en_US |
dc.contributor.authoraffiliation |
Master of Information Technology, BINUS Graduate Program, BINUS University |
en_US |
dc.contributor.authoraffiliation |
Master of Information Technology, BINUS Graduate Program, BINUS University |
en_US |
dc.source.title |
International Journal of Computing and Digital Systems |
en_US |
dc.abbreviatedsourcetitle |
IJCDS |
en_US |