loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Afees Adegoke Odebode ; Allan Tucker ; Mahir Arzoky and Stepehen Swift

Affiliation: Brunel University, London, U.K

Keyword(s): Ensemble Clustering, Subset Selection, Cluster Analysis, Number of Clusters.

Abstract: This research estimates the optimal number of clusters in a dataset using a novel ensemble technique - a preferred alternative to relying on the output of a single clustering. Combining clusterings from different algorithms can lead to a more stable and robust solution, often unattainable by any single clustering solution. Technically, we created subsets of ensembles as possible estimates; and evaluated them using a quality metric to obtain the best subset. We tested our method on publicly available datasets of varying types, sources and clustering difficulty to establish the accuracy and performance of our approach against eight standard methods. Our method outperforms all the techniques in the number of clusters estimated correctly. Due to the exhaustive nature of the initial algorithm, it is slow as the number of ensembles or the solution space increases; hence, we have provided an updated version based on the single-digit difference of Gray code that runs in linear time in terms of the subset size. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.15.197.123

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Odebode, A.; Tucker, A.; Arzoky, M. and Swift, S. (2022). Estimating the Optimal Number of Clusters from Subsets of Ensembles. In Proceedings of the 11th International Conference on Data Science, Technology and Applications - DATA; ISBN 978-989-758-583-8; ISSN 2184-285X, SciTePress, pages 383-391. DOI: 10.5220/0011275000003269

@conference{data22,
author={Afees Adegoke Odebode. and Allan Tucker. and Mahir Arzoky. and Stepehen Swift.},
title={Estimating the Optimal Number of Clusters from Subsets of Ensembles},
booktitle={Proceedings of the 11th International Conference on Data Science, Technology and Applications - DATA},
year={2022},
pages={383-391},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011275000003269},
isbn={978-989-758-583-8},
issn={2184-285X},
}

TY - CONF

JO - Proceedings of the 11th International Conference on Data Science, Technology and Applications - DATA
TI - Estimating the Optimal Number of Clusters from Subsets of Ensembles
SN - 978-989-758-583-8
IS - 2184-285X
AU - Odebode, A.
AU - Tucker, A.
AU - Arzoky, M.
AU - Swift, S.
PY - 2022
SP - 383
EP - 391
DO - 10.5220/0011275000003269
PB - SciTePress