Knowledge Discovery from ISAD, Digital Archive Data, into ArchOnto, a CIDOC-CRM based Linked Model

Dora Melo, Irene Pimenta Rodrigues, Inês Koch

2020

Abstract

This paper presents an automatic semantic migration prototype based on Knowledge Discovery from Digital Archive Data for ontology population in the domain of Archives metadata, ISAD(G). Natural Language Processing (NLP) techniques are used for language processing and Semantic Web techniques for querying and updating the Ontology ArchOnto, a CIDOC-CRM (Conceptual Reference Model) extension. This work is done in the context of project EPISA (Entity and Property Inference for Semantic Archives) where the Portuguese National Archives, Torre do Tombo (ANTT) is one of the partners. The data model and description vocabularies we adopted are built upon the CIDOC-CRM standard, an ontology, developed for museums by the International Committee for Documentation (CIDOC) of the International Council of Museums (ICOM). A detailed example of a baptism document metadata migration is presented to highlight the challenges on the natural language interpretation and the ontology representation.

Download


Paper Citation


in Harvard Style

Melo D., Rodrigues I. and Koch I. (2020). Knowledge Discovery from ISAD, Digital Archive Data, into ArchOnto, a CIDOC-CRM based Linked Model. In Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020) - Volume 2: KEOD; ISBN 978-989-758-474-9, SciTePress, pages 197-204. DOI: 10.5220/0010134101970204


in Bibtex Style

@conference{keod20,
author={Dora Melo and Irene Pimenta Rodrigues and Inês Koch},
title={Knowledge Discovery from ISAD, Digital Archive Data, into ArchOnto, a CIDOC-CRM based Linked Model},
booktitle={Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020) - Volume 2: KEOD},
year={2020},
pages={197-204},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010134101970204},
isbn={978-989-758-474-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020) - Volume 2: KEOD
TI - Knowledge Discovery from ISAD, Digital Archive Data, into ArchOnto, a CIDOC-CRM based Linked Model
SN - 978-989-758-474-9
AU - Melo D.
AU - Rodrigues I.
AU - Koch I.
PY - 2020
SP - 197
EP - 204
DO - 10.5220/0010134101970204
PB - SciTePress