loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: André Lourenço 1 and Ana Fred 2

Affiliations: 1 Instituto de Telecomunicacoes, Instituto Superior de Engenharia de Lisboa, Portugal ; 2 Instituto de Telecomunicacoes, Instituto Superior Tecnico, Portugal

Abstract: We address the problem of clustering of string patterns, in an Ensemble Methods perspective. In this approach different partitionings of the data are combined attempting to find a better and more robust partition. In this study we cover the different phases of this approach: from the generation of the partitions, the clustering ensemble, to the combination and validation of the combined result. For the generation we address, both different clustering algorithms (using both the hierarchical agglomerative concept and partitional approaches) and different similarity measures (string matching, structural resemblance). The focus of the paper is the concept of validation/selection of the final data partition. For that, an information-theoretic measure in conjunction with a variance analysis using bootstrapping is used to quantitatively measure the consistency between partitions and combined results and choose the best obtained result without the use of additional information. Experimental results on a real data set (contour images), show that this approach can be used to unsupervisedly choose the best partition amongst alternative solutions, as validated by measuring the consistency with the ground truth information. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.117.158.47

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Lourenço, A. and Fred, A. (2007). String Patterns: From Single Clustering to Ensemble Methods and Validation. In Proceedings of the 7th International Workshop on Pattern Recognition in Information Systems (ICEIS 2007) - PRIS; ISBN 978-972-8865-93-1, SciTePress, pages 39-48. DOI: 10.5220/0002438400390048

@conference{pris07,
author={André Louren\c{C}o. and Ana Fred.},
title={String Patterns: From Single Clustering to Ensemble Methods and Validation},
booktitle={Proceedings of the 7th International Workshop on Pattern Recognition in Information Systems (ICEIS 2007) - PRIS},
year={2007},
pages={39-48},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002438400390048},
isbn={978-972-8865-93-1},
}

TY - CONF

JO - Proceedings of the 7th International Workshop on Pattern Recognition in Information Systems (ICEIS 2007) - PRIS
TI - String Patterns: From Single Clustering to Ensemble Methods and Validation
SN - 978-972-8865-93-1
AU - Lourenço, A.
AU - Fred, A.
PY - 2007
SP - 39
EP - 48
DO - 10.5220/0002438400390048
PB - SciTePress