Topic-OPA: A Topic Ontology for Modeling Topics of Old Press Articles

Mirna El Ghosh, Cecilia Zanni-Merk, Nicolas Delestre, Jean-Philippe Kotowicz, Habib Abdulrab


Topic ontologies are recently gaining much importance in several domains. Their purpose is to identify the themes necessary to describe the knowledge structure of an application domain. Meanwhile, their development from scratch is hard and time consuming task. This paper discusses the development a topic-specific ontology, named Topic-OPA, for modeling topics of old press articles. Topic-OPA is extracted from the open knowledge graph Wikidata by the application of a SPARQL-based fully automatic approach. The development process of Topic-OPA depends mainly on a set of disambiguated named entities representing the articles. Each named entity is unambiguously identified by a Wikidata URI. In contrast to existent topic ontologies, which are limited to taxonomies, the structure of Topic-OPA is composed of hierarchical and non-hierarchical schemes. The domain application of this work is the old french newspaper Le Matin. Finally, an evaluation process is performed to assess the structure quality of Topic-OPA.


Paper Citation