Construct Semantic Type of “Gene-mutation-disease” Relation by Computer-aided Curation from Biomedical Literature

Dongsheng Zhao, Fan Tong, Zheheng Luo

2019

Abstract

Background: Current semantic type of “gene-mutation-disease” relation lacks fine-grained classification and corresponding relation signal words, which limits its usage in relation extraction from biomedical literature using text mining approach. Methods: We propose a computer-aided curation pipeline in which open relation extraction, signal word clustering, relation type mapping are used to analyze biomedical abstracts for semantic type of “gene-mutation-disease” construction. Coverage metrics are used to evaluate the defined relation type while ClinVar is chosen as a target to test our semantic type’s usability and performance on guiding relation extraction from biomedical literature. Results: We have constructed a 5-layer and 16-category semantic type of “gene-mutation-disease” relation with a vocabulary list containing 58 commonly used relation signal words. The vocabulary list has coverage of 95.08% and the semantic type has coverage of 94.12%. From 25 abstracts linked to 30 ClinVar records, 15 relations are correctly mapped and 8 novel relations are discovered additionally. Conclusion: The results show that our semantic type can cover the main relations between “gene”, “mutation” and “disease” and can achieve good performance on guiding relation extraction from biomedical text even using relatively out-of-date dictionary-based text mining methods.

Download


Paper Citation


in Harvard Style

Zhao D., Tong F. and Luo Z. (2019). Construct Semantic Type of “Gene-mutation-disease” Relation by Computer-aided Curation from Biomedical Literature. In Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS; ISBN 978-989-758-353-7, SciTePress, pages 123-130. DOI: 10.5220/0007688101230130


in Bibtex Style

@conference{bioinformatics19,
author={Dongsheng Zhao and Fan Tong and Zheheng Luo},
title={Construct Semantic Type of “Gene-mutation-disease” Relation by Computer-aided Curation from Biomedical Literature},
booktitle={Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS},
year={2019},
pages={123-130},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007688101230130},
isbn={978-989-758-353-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS
TI - Construct Semantic Type of “Gene-mutation-disease” Relation by Computer-aided Curation from Biomedical Literature
SN - 978-989-758-353-7
AU - Zhao D.
AU - Tong F.
AU - Luo Z.
PY - 2019
SP - 123
EP - 130
DO - 10.5220/0007688101230130
PB - SciTePress