Generating SD-Rules in the SPECIALIST Lexical Tools - Optimization for Suffix Derivation Rule Set

Chris J. Lu, Destinee Tormey, Lynn McCreedy, Allen C. Browne

Abstract

Suffix derivations (SDs) are used with query expansion in concept mapping as an effective Natural Language Processing (NLP) technique to improve recall without sacrificing precision. A systematic approach was proposed to generate derivations in the SPECIALIST Lexical Tools in which SD candidate rules were used to retrieve SD-pairs from the SPECIALIST Lexicon (Lu et al., 2012). Good SD candidate rules are gathered as SD-Rules in Lexical Tools for generating SDs that are not known to the Lexicon. This paper describes a methodology to select an optimized SD-Rule set that meets our requirement of 95\% system precision with best system performance from SD candidate rules. The results of the latest three releases of Lexical Tools show: 1) system precision and recall of selected SD-Rules are above 95\%. 2) a consistency between a computational linguistic approach and traditional linguistic knowledge for selecting the best Parent-Child rules. 3) a consistent approach yielding similar SD-Rule sets and system performance. Ultimately, it results in better precision and recall for NLP applications using Lexical Tools derivational related flow components.

References

  1. Aho, A.V., Ullman, J.D., and Hopcroft, J.E., 1983. Data Structure and Algorithms. Addison Wesley, pages 163- 169.
  2. Aronson, A.R. and Lang, F.M., 2010. An Overview of MetaMap: Historical Perspective and Recent Advances. JAMIA, Vol. 17, pages 229-236.
  3. Divita, G., Zeng, Q.T., Gundlapalli, A.V., Duvall, S., Nebeker, J., and Samore, M.H., 2014. Sophia: An Expedient UMLS Concept Extraction Annotator. In proceeding of AMIA 2014 Annual Symposium, pages 467-476, Washington, DC, USA, November 15-19.
  4. Dorland, W.A., 2003. Dorland's Illustrated Medical Dictionary, 30th edition, W. B. Saunders Company. Philadelphia, Pa, page 763.
  5. Fung, K.W., McDonald, C., and Srinivasan, S., 2010. The UMLS-CORE Project: A Study of the Problem List Terminologies Used in Large Healthcare Institutions. JAMIA, Vol. 17, pages 675-680.
  6. Gaussier, E., 1999. Unsupervised learning of derivational morphology from inflectional lexicons. In: Kehler A and Stolcke A, eds, ACL workshop on Unsupervised Methods in Natural Language Learning, College Park, MD. June.
  7. Grabar, N. and Zweigenbaum, P., 2000. A General Method for Sifting Linguistic Knowledge from Structured Terminologies. In proceeding of AMIA 2000 Annual Symposium, page. 310-314, Los Angeles, CA, USA, November 4-8.
  8. Grabar, N. and Zweigenbaum, P., 1999. Language Independent Automatic Acquisition of Morphological Knowledge from Synonym Pairs. In Proceeding of AMIA 1999 Annual Symposium, page. 77-81, Washington, DC, USA, November 6-10.
  9. Lu, C.J., McCreedy, L., Tormey, D., and Browne, A.C., 2012. A Systematic Approach for Automatically Generating Derivational Variants in Lexical Tools Based on the SPECIALIST Lexicon. IEEE IT Professional Magazine, May/June, pages 36-42.
  10. Lu, C.J., Tormey, D., McCreedy, L., and Browne, A.C., 2013. Implementing Comprehensive Derivational Features in Lexical Tools Using a Systematical Approach. In proceeding of AMIA 2013 Annual Symposium, Wash., DC, USA, Nov. 16-20, page 904.
  11. McCray, A.T., Aronson, A.R., Browne, A.C., Rindflesch, T.C., Razi, A., and Srinivasan, S., 1993. UMLS Knowledge for Biomedical Language Processing. Bull. Medical Library Assoc., vol. 81, no. 2, page 184-194.
  12. National Library of Medicine, Lexical Tools 2014, Optimizing 2014 SD-Rule Set - Add SD-Rules from Other Suggestions. Available from: <http://lsg3. nlm.nih.gov/LexSysGroup/Projects/lvg/2014/docs/desi gnDoc/UDF/derivations/SD-Rules-Opti/ex-addsuggest.html>. (24 October 2013).
  13. National Library of Medicine, Lexical Tools 2015, Optimizing 2015 SD-Rule Set - Optimum Log. Available from: <http://lsg3.nlm.nih.gov/LexSysGroup /Projects/lvg/2015/docs/designDoc/UDF/derivations/S D-Rules-Opti/Ex-2015/optiLog.html>. (9 September 2014).
  14. National Library of Medicine, Lexical Tools 2016, Optimizing 2016 SD-Rule Set - Optimum Log. Available from: <http://lsg3.nlm.nih.gov/LexSysGroup /Projects/lvg/2016/docs/designDoc/UDF/derivations/S D-Rules-Opti/Ex-2016/optiLog.html>. (17 September 2015).
  15. Pacak, M.G., Norton, L.M., and Dunham, G.S., 1980. Morphosemantic Analysis of - ITIS Forms in Medical Language. J. Methods of Information in Medicine, vol. 19, no. 2, page 99-105.
  16. Wolff. S., Automatic Coding of Medical Vocabulary. 1987. Medical Information Processing - Computer Management of Narrative Data. Addison Wesley, Reading Mass, page 145-162.
Download


Paper Citation


in Harvard Style

Lu C., Tormey D., McCreedy L. and Browne A. (2016). Generating SD-Rules in the SPECIALIST Lexical Tools - Optimization for Suffix Derivation Rule Set . In Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, (BIOSTEC 2016) ISBN 978-989-758-170-0, pages 353-358. DOI: 10.5220/0005731303530358


in Bibtex Style

@conference{healthinf16,
author={Chris J. Lu and Destinee Tormey and Lynn McCreedy and Allen C. Browne},
title={Generating SD-Rules in the SPECIALIST Lexical Tools - Optimization for Suffix Derivation Rule Set},
booktitle={Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, (BIOSTEC 2016)},
year={2016},
pages={353-358},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005731303530358},
isbn={978-989-758-170-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, (BIOSTEC 2016)
TI - Generating SD-Rules in the SPECIALIST Lexical Tools - Optimization for Suffix Derivation Rule Set
SN - 978-989-758-170-0
AU - Lu C.
AU - Tormey D.
AU - McCreedy L.
AU - Browne A.
PY - 2016
SP - 353
EP - 358
DO - 10.5220/0005731303530358