Extracting Event-related Information from a Corpus Regarding Soil Industrial Pollution

Chuanming Dong, Chuanming Dong, Philippe Gambette, Catherine Dominguès

2021

Abstract

We study the extraction and reorganization of event-related information in texts regarding industrial pollution. The object is to build a memory of polluted sites that gathers the information about industrial events from various databases and corpora. An industrial event is described through several features as the event trigger, the industrial activity, the institution, the pollutant, etc. In order to efficiently collect information from a large corpus, it is necessary to automatize the information extraction process. To this end, we manually annotated a part of a corpus about soil industrial pollution, then we used it to train information extraction models with deep learning methods. The models we trained achieve 0.76 F-score on event feature extraction. We intend to improve the models and then use them on other text resources to enrich the polluted sites memory with extracted information about industrial events.

Download


Paper Citation


in Harvard Style

Dong C., Gambette P. and Dominguès C. (2021). Extracting Event-related Information from a Corpus Regarding Soil Industrial Pollution. In Proceedings of the 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2021) - Volume 1: KDIR; ISBN 978-989-758-533-3, SciTePress, pages 217-224. DOI: 10.5220/0010656700003064


in Bibtex Style

@conference{kdir21,
author={Chuanming Dong and Philippe Gambette and Catherine Dominguès},
title={Extracting Event-related Information from a Corpus Regarding Soil Industrial Pollution},
booktitle={Proceedings of the 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2021) - Volume 1: KDIR},
year={2021},
pages={217-224},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010656700003064},
isbn={978-989-758-533-3},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2021) - Volume 1: KDIR
TI - Extracting Event-related Information from a Corpus Regarding Soil Industrial Pollution
SN - 978-989-758-533-3
AU - Dong C.
AU - Gambette P.
AU - Dominguès C.
PY - 2021
SP - 217
EP - 224
DO - 10.5220/0010656700003064
PB - SciTePress