loading
Documents

Research.Publish.Connect.

Paper

Authors: Boshu Ru 1 ; Charles Warner-Hillard 1 ; Yong Ge 1 and Lixia Yao 2

Affiliations: 1 University of North Carolina at Charlotte, United States ; 2 Mayo Clinic and University of North Carolina at Charlotte, United States

ISBN: 978-989-758-213-4

Keyword(s): Social Media, Drug Repositioning, Machine Learning, Patient-Reported Outcomes.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Biomedical Engineering ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Health Information Systems ; Pattern Recognition and Machine Learning ; Sensor Networks ; Signal Processing ; Soft Computing

Abstract: Drug repositioning reduces safety risk and development cost, compared to developing new drugs. Computational approaches have examined biological, chemical, literature, and electronic health record data for systematic drug repositioning. In this work, we built an entire computational pipeline to investigate the feasibility of mining a new data source – the fast-growing online patient forum data for identifying and verifying drug-repositioning hypotheses. We curated a gold-standard dataset based on filtered drug reviews from WebMD. Among 15,714 sentences, 447 mentioned novel desirable drug usages that were not listed as known drug indications by WebMD and thus were defined as serendipitous drug usages. We then constructed 347 features using text-mining methods and drug knowledge. Finally we built SVM, random forest and AdaBoost.M1 classifiers and evaluated their classification performance. Our best model achieved an AUC score of 0.937 on the independent test dataset, with precision equa l to 0.811 and recall equal to 0.476. It successfully predicted serendipitous drug usages, including metformin and bupropion for obesity, tramadol for depression and ondansetron for irritable bowel syndrome with diarrhea. Machine learning methods make this new data source feasible for studying drug repositioning. Our future efforts include constructing more informative features, developing more effective methods to handle imbalance data, and verifying prediction results using other existing methods. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.80.4.76

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Ru, B.; Warner-Hillard, C.; Ge, Y. and Yao, L. (2017). Identifying Serendipitous Drug Usages in Patient Forum Data - A Feasibility Study.In Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, (BIOSTEC 2017) ISBN 978-989-758-213-4, pages 106-118. DOI: 10.5220/0006145201060118

@conference{healthinf17,
author={Boshu Ru. and Charles Warner{-}Hillard. and Yong Ge. and Lixia Yao.},
title={Identifying Serendipitous Drug Usages in Patient Forum Data - A Feasibility Study},
booktitle={Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, (BIOSTEC 2017)},
year={2017},
pages={106-118},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006145201060118},
isbn={978-989-758-213-4},
}

TY - CONF

JO - Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, (BIOSTEC 2017)
TI - Identifying Serendipitous Drug Usages in Patient Forum Data - A Feasibility Study
SN - 978-989-758-213-4
AU - Ru, B.
AU - Warner-Hillard, C.
AU - Ge, Y.
AU - Yao, L.
PY - 2017
SP - 106
EP - 118
DO - 10.5220/0006145201060118

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.