Authors: Hiba Khalid 1 ; Esteban Zimanyi 2 and Robert Wrembel 3

Affiliations: 1 Department of Code, ULB, Brussels, Belgium, Department of Informatics, PUT, Poznan and Poland ; 2 Department of Code, ULB, Brussels and Belgium ; 3 Department of Informatics, PUT, Poznan and Poland

ISBN: 978-989-758-318-6

ISSN: 2184-285X

Keyword(s): Fuzzy Sets, Fuzzy Union, Crisp Sets, Metadata, Data Integration, Knowledge Base.

Related Ontology Subjects/Areas/Topics: Applications ; Artificial Intelligence ; Business Analytics ; Business Intelligence ; Collaboration and e-Services ; Data Analytics ; Data Engineering ; Data Management and Quality ; e-Business ; Enterprise Information Systems ; Information Integration ; Integration/Interoperability ; Knowledge Discovery and Information Retrieval ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Ontologies and the Semantic Web ; Software Engineering ; Symbolic Systems ; Text Analytics

Abstract: The problem of data integration is one of the most debated issues in the general field of data management. Data Integration is typically accompanied by a concept of conflict management. The problem’s root arises from different data sources and the probability of how each data source corresponds to another. Metadata is also another important yet, highly overlooked concept in these research areas. In this paper we propose the idea of leveraging metadata as a binding source in the process of integration. The research technique relies on exploiting textual metadata from different sources by using Fuzzy logic as a coherence measure. A framework methodology has been devised for understanding the power of textual metadata. The framework operates on multiple data sources typically a data source set can contain ‘n’ number of datasets. In case of considering two data sources the sources can be titled as primary and secondary. The primary secondary source is the accepting data source and thus co ntains more enriched metadata. The secondary sources are the requesting sources for integration and are also guided by textual data summaries, keywords, analysis reports etc. The Fuzzy MD framework operates on finding similarities between primary and secondary metadata sources using fuzzy matching and string exploration. The model then provides the probable answer for each set’s association with the primary accepting source. The framework relies on origin of words and relative associativity rather than the common approach of manual metadata enrichment. This not only resolves the argument of manual metadata enrichment, it also provides a hidden solution for generating metadata from scratch as a part of the integration and analysis process. (More)

PDF ImageFull Text


Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Khalid, H.; Zimanyi, E. and Wrembel, R. (2018). Fuzzy Metadata Strategies for Enhanced Data Integration.In Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA, ISBN 978-989-758-318-6, ISSN 2184-285X, pages 83-90. DOI: 10.5220/0006905200830090

author={Hiba Khalid. and Esteban Zimanyi. and Robert Wrembel.},
title={Fuzzy Metadata Strategies for Enhanced Data Integration},
booktitle={Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA,},


JO - Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA,
TI - Fuzzy Metadata Strategies for Enhanced Data Integration
SN - 978-989-758-318-6
AU - Khalid, H.
AU - Zimanyi, E.
AU - Wrembel, R.
PY - 2018
SP - 83
EP - 90
DO - 10.5220/0006905200830090

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.