loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Davide Varagnolo 1 ; Guilherme Antas 2 ; Mariana Ramos 2 ; Sara Amaral 2 ; Dora Melo 2 ; 3 and Irene Pimenta Rodrigues 2

Affiliations: 1 Department of Informatics, University of Évora, Portugal ; 2 NOVA Laboratory for Computer Science and Informatics, NOVA LINCS, Portugal ; 3 Polytechnic of Coimbra, Coimbra Business School—ISCAC, Coimbra, Portugal

Keyword(s): Natural Language Processing, Knowledge Representation, Knowledge Discovery, Semantic Web, Archives Linked Data Semantic Representation.

Abstract: This paper presents a method for extracting information from ISAD(G) elements, that contain semi-structured text descriptions. Natural language processing is done using Gate environment and defining the set of Jape rules necessary to process the text and extract the intended information. The evaluation of the information extraction processes is done in a sample of 800 records for each type of information, and a dataset that is manually built for each type of information considered, such as baptisms, passport requisitions testaments, etc. The implementation of several automatic information extraction processes enables the population of the CIDOC-CRM knowledge base with new linked events and entities automatically. The exploration of the information, migrated from DigitArq and extracted from text descriptions represented in CIDOC-CRM, is done through SPARQL queries enabling new visualisations of the archival records and the retrieval of information collected in different records from d ifferent archives. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.137.218.215

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Varagnolo, D.; Antas, G.; Ramos, M.; Amaral, S.; Melo, D. and Rodrigues, I. (2022). Evaluating and Exploring Text Fields Information Extraction into CIDOC-CRM. In Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022) - KEOD; ISBN 978-989-758-614-9; ISSN 2184-3228, SciTePress, pages 177-184. DOI: 10.5220/0011550700003335

@conference{keod22,
author={Davide Varagnolo. and Guilherme Antas. and Mariana Ramos. and Sara Amaral. and Dora Melo. and Irene Pimenta Rodrigues.},
title={Evaluating and Exploring Text Fields Information Extraction into CIDOC-CRM},
booktitle={Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022) - KEOD},
year={2022},
pages={177-184},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011550700003335},
isbn={978-989-758-614-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022) - KEOD
TI - Evaluating and Exploring Text Fields Information Extraction into CIDOC-CRM
SN - 978-989-758-614-9
IS - 2184-3228
AU - Varagnolo, D.
AU - Antas, G.
AU - Ramos, M.
AU - Amaral, S.
AU - Melo, D.
AU - Rodrigues, I.
PY - 2022
SP - 177
EP - 184
DO - 10.5220/0011550700003335
PB - SciTePress