CLUSTERING OF HETEROGENEOUSLY TYPED DATA WITH SOFT COMPUTING

Angel Kuri-Morales; Luis Enrique Cortes-Berrueco; Daniel Trejo-Baños

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

CLUSTERING OF HETEROGENEOUSLY TYPED DATA WITH SOFT COMPUTING

Topics: Clustering and Classification Methods; Information Extraction; Machine Learning; Pre-Processing and Post-Processing for Data Mining

In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 0IC3K, 491-494, 2011 , Paris, France

Authors: Angel Kuri-Morales ¹ ; Luis Enrique Cortes-Berrueco ² and Daniel Trejo-Baños ²

Affiliations: ¹ Instituto Tecnológico Autónomo de México, Mexico ; ² Universidad Nacional Autónoma de México, Mexico

Keyword(s): Clustering, Categorical variables, Soft computing, Data mining.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Evolutionary Computing ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Pre-Processing and Post-Processing for Data Mining ; Soft Computing ; Symbolic Systems

Abstract: The problem of finding clusters in arbitrary sets of data has been attempted using different approaches. In most cases, the use of metrics in order to determine the adequateness of the said clusters is assumed. That is, the criteria yielding a measure of quality of the clusters depends on the distance between the elements of each cluster. Typically, one considers a cluster to be adequately characterized if the elements within a cluster are close to one another while, simultaneously, they appear to be far from those of different clusters. This intuitive approach fails if the variables of the elements of a cluster are not amenable to distance measurements, i.e., if the vectors of such elements cannot be quantified. This case arises frequently in real world applications where several variables correspond to categories. The usual tendency is to assign arbitrary numbers to every category: to encode the categories. This, however, may result in spurious patterns: relationships between the v ariables which are not really there at the offset. It is evident that there is no truly valid assignment which may ensure a universally valid numerical value to this kind of variables. But there is a strategy which guarantees that the encoding will, in general, not bias the results. In this paper we explore such strategy. We discuss the theoretical foundations of our approach and prove that this is the best strategy in terms of the statistical behaviour of the sampled data. We also show that, when applied to a complex real world problem, it allows us to generalize soft computing methods to find the number and characteristics of a set of clusters. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.137.188.201

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Kuri-Morales, A.; Cortes-Berrueco, L. and Trejo-Baños, D. (2011). CLUSTERING OF HETEROGENEOUSLY TYPED DATA WITH SOFT COMPUTING. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR; ISBN 978-989-8425-79-9; ISSN 2184-3228, SciTePress, pages 491-494. DOI: 10.5220/0003690304990502

@conference{kdir11,
author={Angel Kuri{-}Morales. and Luis Enrique Cortes{-}Berrueco. and Daniel Trejo{-}Baños.},
title={CLUSTERING OF HETEROGENEOUSLY TYPED DATA WITH SOFT COMPUTING},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR},
year={2011},
pages={491-494},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003690304990502},
isbn={978-989-8425-79-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR
TI - CLUSTERING OF HETEROGENEOUSLY TYPED DATA WITH SOFT COMPUTING
SN - 978-989-8425-79-9
IS - 2184-3228
AU - Kuri-Morales, A.
AU - Cortes-Berrueco, L.
AU - Trejo-Baños, D.
PY - 2011
SP - 491
EP - 494
DO - 10.5220/0003690304990502
PB - SciTePress