USING A CLUSTERING ALGORITHM FOR DOMAIN RELATED ONTOLOGY CONSTRUCTION

Hongyan Yi; V. J. Rayward-Smith

doi:10.5220/0002331803360341

USING A CLUSTERING ALGORITHM FOR DOMAIN RELATED ONTOLOGY CONSTRUCTION

Hongyan Yi, V. J. Rayward-Smith

2009

Abstract

Fisher’s clustering algorithm is exploited to build a cluster hierarchy. Then this methodology is used to automatically generate the taxonomies of the nominal attribute values for a real world database. An ontology for a specific analysis task is finally constructed, which reflects some interesting behaviour of real data. Although this semi-automatically constructed ontology may be different from the widely accepted one for the same domain, it may indicate the true character of the data from the statistical point of view and have a semantic interpretation as well as being more suitable for the specific data mining application.

References

Asuncion, A. and Newman, D. J. (2007). UCI machine learning repository. University of California, Irvine, School of Information and Computer Sciences.
Fisher, W. D. (1958). On grouping for maximum homogeneity. Journal of the American Statistical Association, 53:789-798.
Genesereth, M. R. and Nilsson, N. J. (1987). Logical Foundation of Artificial Intelligence. Kauffman, Los Altos, California.
Gruber, T. R. (1993). A translation approach to portable ontology specifications. In Knowledge Acquisition, volume 5, pages 199-220.
Hartigan, J. A. (1975). Clustering Algorithms. New York: John Wiley & Sons, Inc. Pages 130-142.
Kaufman, L. and Rousseeuw, P. J. (1990). Finding Groups in Data: An Introduction to Cluster Analysis. New York: John Wiley & Sons, Inc.
Khan, L. and Luo, F. (2002). Ontology construction for information selection. In Proc. of 14th IEEE International Conference on Tools with Artificial Intelligence.
McQueen, J. B. (1967). Some methods for classification and analysis of multivariate observations. In Proc. of the 5th Berkeley Symposium on Mathematical Statistics and Probability, volume 1, pages 281-297, Berkeley.
Tan, P. N., Steinbach, M., and Kumar, V. (2006). Introduction to Data Mining. Pearson Education, Boston.
Yi, H. (2009). The Construction and Exploitation of Attribute-Value Taxonomies in Data Mining. PhD thesis, University of East Anglia, to be submitted.
Yi, H. Y., Iglesia, B. d. l., and Rayward-Smith, V. J. (2005). Using concept taxonomies for effective tree induction. In Computational Intelligence and Security International Conference (CIS 2005), volume LNAI 3802, pages 1011-1016.

Download

Paper Citation

in Harvard Style

Yi H. and J. Rayward-Smith V. (2009). USING A CLUSTERING ALGORITHM FOR DOMAIN RELATED ONTOLOGY CONSTRUCTION . In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2009) ISBN 978-989-674-012-2, pages 336-341. DOI: 10.5220/0002331803360341

in Bibtex Style

@conference{keod09,
author={Hongyan Yi and V. J. Rayward-Smith},
title={USING A CLUSTERING ALGORITHM FOR DOMAIN RELATED ONTOLOGY CONSTRUCTION},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2009)},
year={2009},
pages={336-341},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002331803360341},
isbn={978-989-674-012-2},
}

in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2009)
TI - USING A CLUSTERING ALGORITHM FOR DOMAIN RELATED ONTOLOGY CONSTRUCTION
SN - 978-989-674-012-2
AU - Yi H.
AU - J. Rayward-Smith V.
PY - 2009
SP - 336
EP - 341
DO - 10.5220/0002331803360341