loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: João M. N. Duarte 1 ; Ana L. N. Fred 2 and F. Jorge F. Duarte 3

Affiliations: 1 Instituto de Telecomunicações, Instituto Superior Técnico and Polytechnic of Porto, Portugal ; 2 Instituto de Telecomunicações and Instituto Superior Técnico, Portugal ; 3 Polytechnic of Porto, Portugal

ISBN: 978-989-8565-75-4

Keyword(s): Clustering Validation, Constrained Data Clustering.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Data Reduction and Quality Assessment ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Symbolic Systems

Abstract: Much attention is being given to the incorporation of constraints into data clustering, mainly expressed in the form of must-link and cannot-link constraints between pairs of domain objects. However, its inclusion in the important clustering validation process was so far disregarded. In this work, we integrate the use of constraints in clustering validation. We propose three approaches to accomplish it: produce a weighted validity score considering a traditional validity index and the constraint satisfaction ratio; learn a new distance function or feature space representation which better suits the constraints, and use it with a validation index; and a combination of the previous. Experimental results in 14 synthetic and real data sets have shown that including the information provided by the constraints increases the performance of the clustering validation process in selecting the best number of clusters.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.207.134.98

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
M. N. Duarte, J.; L. N. Fred, A. and F. Duarte, F. (2013). Data Clustering Validation using Constraints.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: KDIR, (IC3K 2013) ISBN 978-989-8565-75-4, pages 17-27. DOI: 10.5220/0004543800170027

@conference{kdir13,
author={João M. N. Duarte. and Ana L. N. Fred. and F. Jorge F. Duarte.},
title={Data Clustering Validation using Constraints},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: KDIR, (IC3K 2013)},
year={2013},
pages={17-27},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004543800170027},
isbn={978-989-8565-75-4},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: KDIR, (IC3K 2013)
TI - Data Clustering Validation using Constraints
SN - 978-989-8565-75-4
AU - M. N. Duarte, J.
AU - L. N. Fred, A.
AU - F. Duarte, F.
PY - 2013
SP - 17
EP - 27
DO - 10.5220/0004543800170027

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.