loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Diego Vallejo-Huanga 1 ; 2 ; Cèsar Ferri 2 and Fernando Martínez-Plumed 2

Affiliations: 1 IDEIAGEOCA Research Group, Universidad Politécnica Salesiana, Quito, Ecuador ; 2 VRAIN, Universitat Politècnica de València, Valencia, Spain

Keyword(s): Size-Constrained Clustering, K-MedoidsSC, CSCLP, Interactive Web Application, R Shiny, User Experience.

Abstract: Size-constrained clustering addresses a fundamental need in many real-world applications by ensuring that clusters adhere to user-specified size limits, whether to balance groups or to satisfy domain-specific requirements. In this paper, we present ClustSize, an interactive web platform that implements two advanced algorithms: K-MedoidsSC and CSCLP, to perform real-time clustering of tabular data under strict size constraints. Developed in R Studio using the Shiny framework and deployed on Shinyapps.io, ClustSize not only enforces precise cluster cardinalities, but also facilitates dynamic parameter tuning and visualisation for enhanced user exploration. We comprehensive validate its performance through comprehensive benchmarking, also evaluating runtime, RAM usage, load, and stress conditions, and gather usability insights via user surveys. Post-deployment evaluations confirm that both algorithms consistently produce clusters that exactly meet the specified size limits, and that the system reliably supports up to 50 concurrent users and maintains functionality under stress, processing approximately 90 requests in 5 seconds. These results highlight the potential of integrating advanced size-constrained clustering into interactive web platforms for practical data analysis. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.40

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Vallejo-Huanga, D., Ferri, C. and Martínez-Plumed, F. (2025). ClustSize: An Algorithmic Framework for Size-Constrained Clustering. In Proceedings of the 14th International Conference on Data Science, Technology and Applications - Volume 1: DATA; ISBN 978-989-758-758-0; ISSN 2184-285X, SciTePress, pages 481-490. DOI: 10.5220/0013558900003967

@conference{data25,
author={Diego Vallejo{-}Huanga and Cèsar Ferri and Fernando Martínez{-}Plumed},
title={ClustSize: An Algorithmic Framework for Size-Constrained Clustering},
booktitle={Proceedings of the 14th International Conference on Data Science, Technology and Applications - Volume 1: DATA},
year={2025},
pages={481-490},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013558900003967},
isbn={978-989-758-758-0},
issn={2184-285X},
}

TY - CONF

JO - Proceedings of the 14th International Conference on Data Science, Technology and Applications - Volume 1: DATA
TI - ClustSize: An Algorithmic Framework for Size-Constrained Clustering
SN - 978-989-758-758-0
IS - 2184-285X
AU - Vallejo-Huanga, D.
AU - Ferri, C.
AU - Martínez-Plumed, F.
PY - 2025
SP - 481
EP - 490
DO - 10.5220/0013558900003967
PB - SciTePress