loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Kazuo Hara 1 ; Ikumi Suzuki 1 ; Kousaku Okubo 1 and Isamu Muto 2

Affiliations: 1 National Institute of Genetics, Japan ; 2 BITS. Co. and Ltd., Japan

ISBN: 978-989-758-048-2

Keyword(s): Semi-Automated Information Extraction, Cohesive Text, Itemized Text.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining Text and Semi-Structured Data ; Pre-Processing and Post-Processing for Data Mining ; Symbolic Systems

Abstract: Anatomical knowledge written in a textbook is almost completely unreusable computationally, because it is embedded in a cohesive discourse. In discourse contexts, the frequent use of cohesive ties such as reference expressions and coordinated phrases not only troubles the function of automated systems (i.e., natural language parsers) to extract knowledge from the resulting complicated sentences, but also affects the identification of mentions of anatomical named entities (NEs). We propose to revamp the prose style of anatomical textbooks by transforming cohesive discourse into itemized text, which can be accomplished by annotating reference expressions and coordinating conjunctions. Then, automatically, each anaphor will be replaced by its antecedent in each reference expression, and the conjoined elements are distributed to sentences duplicated for each coordinating conjunction connecting phrases. We demonstrate that, compared to the original text, the transformed one is easy for mac hines to process and hence convenient as a way of identifying mentions of anatomical NEs and their relations. Since the transformed text is human readable as well, we believe our approach provides a promising new model for language resources accessible by both human and machine, improving the computational reusability of textbooks. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.83.192.109

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Hara, K.; Hara, K.; Suzuki, I.; Suzuki, I.; Okubo, K. and Muto, I. (2014). Annotating Cohesive Statements of Anatomical Knowledge Toward Semi-automated Information Extraction.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014) ISBN 978-989-758-048-2, pages 342-347. DOI: 10.5220/0005132303420347

@conference{kdir14,
author={Kazuo Hara. and Kazuo Hara. and Ikumi Suzuki. and Ikumi Suzuki. and Kousaku Okubo. and Isamu Muto.},
title={Annotating Cohesive Statements of Anatomical Knowledge Toward Semi-automated Information Extraction},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014)},
year={2014},
pages={342-347},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005132303420347},
isbn={978-989-758-048-2},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2014)
TI - Annotating Cohesive Statements of Anatomical Knowledge Toward Semi-automated Information Extraction
SN - 978-989-758-048-2
AU - Hara, K.
AU - Hara, K.
AU - Suzuki, I.
AU - Suzuki, I.
AU - Okubo, K.
AU - Muto, I.
PY - 2014
SP - 342
EP - 347
DO - 10.5220/0005132303420347

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.