Healthc Inform Res.  2018 Oct;24(4):376-380. 10.4258/hir.2018.24.4.376.

Design and Construction of a NLP Based Knowledge Extraction Methodology in the Medical Domain Applied to Clinical Information

Affiliations
  • 1Technological University of Panama, Panama City, Panama. miguel.vargas@utp.ac.pa

Abstract


OBJECTIVES
This research presents the design and development of a software architecture using natural language processing tools and the use of an ontology of knowledge as a knowledge base.
METHODS
The software extracts, manages and represents the knowledge of a text in natural language. A corpus of more than 200 medical domain documents from the general medicine and palliative care areas was validated, demonstrating relevant knowledge elements for physicians.
RESULTS
Indicators for precision, recall and F-measure were applied. An ontology was created called the knowledge elements of the medical domain to manipulate patient information, which can be read or accessed from any other software platform.
CONCLUSIONS
The developed software architecture extracts the medical knowledge of the clinical histories of patients from two different corpora. The architecture was validated using the metrics of information extraction systems.

Keyword

Knowledge; Knowledge Management; Natural Language Processing; Information Extraction

MeSH Terms

Humans
Information Storage and Retrieval
Knowledge Bases
Knowledge Management
Natural Language Processing
Palliative Care

Figure

  • Figure 1 Phases of the proposed architecture. NLP: natural language processing, JAPE: Java Annotation Pattern Engine.

  • Figure 2 Excerpt of a rule code written in JAPE (Java Annotation Pattern Engine) for the extraction of annotations.

  • Figure 3 Excerpt from the hierarchy of classes of the domain ontology.

  • Figure 4 Excerpt from the created ontology.

  • Figure 5 Segment of the Clinical Document Architecture file in XML format.


Cited by  1 articles

ANNO: A General Annotation Tool for Bilingual Clinical Note Information Extraction
Kye Hwa Lee, Hyunsung Lee, Jin-Hyeok Park, Yi-Jun Kim, Youngho Lee
Healthc Inform Res. 2022;28(1):89-94.    doi: 10.4258/hir.2022.28.1.89.


Reference

1. Blaya JA, Fraser HS, Holt B. E-health technologies show promise in developing countries. Health Aff (Millwood). 2010; 29(2):244–251.
2. Terzieva M. Project knowledge management: how organizations learn from experience. Procedia Technol. 2004; 16:1086–1095.
Article
3. Maedche A, Motik B, Stojanovic L, Studer R, Volz R. Ontologies for enterprise knowledge manage ment. IEEE Intell Syst. 2003; 18(2):26–33.
Article
4. Studer R, Benjamins VR, Fensel D. Knowledge engineering: principles and methods. Data Knowl Eng. 1998; 25(1):161–198.
5. Legaz-Garcia Mdel C, Menarguez-Tortosa M, Fernandez-Breis JT, Chute CG, Tao C. Transformation of standardized clinical models based on OWL technologies: from CEM to OpenEHR archetypes. J Am Med Inform Assoc. 2015; 22(3):536–544.
6. Friedman C, Rindflesch TC, Corn M. Natural language processing: state of the art and prospects for significant progress, a workshop sponsored by the National Library of Medicine. J Biomed Inform. 2013; 46(5):765–773.
Article
7. Ruiz-Martinez JM, Valencia-Garcia R, Fernandez-Breis JT, Garcia-Sanchez F, Martinez-Bejar R. Ontology learning from biomedical natural language documents using UMLS. Expert Syst Appl. 2011; 38(10):12365–12378.
Article
8. Inniss TR, Lee JR, Light M, Grassi MA, Thomas G, Williams AB. Towards applying text mining and natural language processing for biomedical ontology acquisition. In : Proceedings of the 1st International Workshop on Text Mining in Bioinformatics; 2006 Nov 10; Arlington, VA. p. 7–14.
9. Manine AP, Alphonse E, Bessieres P. Information extraction as an ontology population task and its application to genic interactions. In : Proceedings of the 20th IEEE International Conference on Tools with Artificial Intelligence; 2008 Nov 3-5; Dayton, OH. p. 74–81.
10. Quesada-Martinez M, Mikroyannidi E, Fernandez-Breis JT, Stevens R. Approaching the axiomatic enrichment of the Gene Ontology from a lexical perspective. Artif Intell Med. 2015; 65(1):35–48.
Article
11. Thakker D, Osman T, Lakin P. GATE JAPE grammar tutorial [Internet]. Sheffield: The University of Sheffield;2009. cited at 2018 Oct 1. Available from: https://gate.ac.uk/sale/thakker-jape-tutorial/GATE%20JAPE%20manual.pdf.
Article
12. IJntema W, Sangers J, Hogenboom F, Frasincar F. A lexico-semantic pattern language for learning ontology instances from text. Web Semant. 2012; 15:37–50.
13. Wyner AZ, Schneider J, Atkinson K, Bench-Capon TJ. Semi-automated argumentative analysis of online product reviews. COMMA. 2012; 245:43–50.
14. Lee CH, Wang SH. An information fusion approach to integrate image annotation and text mining methods for geographic knowledge discovery. Expert Syst Appl. 2012; 39(10):8954–8967.
Article
15. Horridge M, Knublauch H, Rector A, Stevens R, Wroe C. A practical guide to building OWL ontologies using the Protege-OWL plugin and CO-ODE tools, Edition 1.0 [Internet]. Manchester: The University of Manchester;2014. cited at 2018 Oct 1. Available from: http://mowlpower.cs.man.ac.uk/protegeowltutorial/resources/ProtegeOWLTutorialP3_ v1_0.pdf.
Article
16. Fernandez-Breis JT, Chiba H, Legaz-Garcia Mdel C, Uchiyama I. The orthology ontology: development and applications. J Biomed Semantics. 2016; 7(1):34.
17. Lupse O, Vida M, Stoicu-Tivadar L, Stoicu-Tivadar V. Using HL7 CDA and CCD standards to improve communication between healthcare information systems. In : Proceedings of 2011 IEEE 9th International Symposium on Intelligent Systems and Informatics (SISY); 2011 Sep 8-10; Subotica, Serbia. p. 453–457.
Full Text Links
  • HIR
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2024 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr