Cargando…
Named Entity Recognition in Electronic Health Records: A Methodological Review
OBJECTIVES: A substantial portion of the data contained in Electronic Health Records (EHR) is unstructured, often appearing as free text. This format restricts its potential utility in clinical decision-making. Named entity recognition (NER) methods address the challenge of extracting pertinent info...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korean Society of Medical Informatics
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10651400/ https://www.ncbi.nlm.nih.gov/pubmed/37964451 http://dx.doi.org/10.4258/hir.2023.29.4.286 |
_version_ | 1785135988297695232 |
---|---|
author | Durango, María C. Torres-Silva, Ever A. Orozco-Duque, Andrés |
author_facet | Durango, María C. Torres-Silva, Ever A. Orozco-Duque, Andrés |
author_sort | Durango, María C. |
collection | PubMed |
description | OBJECTIVES: A substantial portion of the data contained in Electronic Health Records (EHR) is unstructured, often appearing as free text. This format restricts its potential utility in clinical decision-making. Named entity recognition (NER) methods address the challenge of extracting pertinent information from unstructured text. The aim of this study was to outline the current NER methods and trace their evolution from 2011 to 2022. METHODS: We conducted a methodological literature review of NER methods, with a focus on distinguishing the classification models, the types of tagging systems, and the languages employed in various corpora. RESULTS: Several methods have been documented for automatically extracting relevant information from EHRs using natural language processing techniques such as NER and relation extraction (RE). These methods can automatically extract concepts, events, attributes, and other data, as well as the relationships between them. Most NER studies conducted thus far have utilized corpora in English or Chinese. Additionally, the bidirectional encoder representation from transformers using the BIO tagging system architecture is the most frequently reported classification scheme. We discovered a limited number of papers on the implementation of NER or RE tasks in EHRs within a specific clinical domain. CONCLUSIONS: EHRs play a pivotal role in gathering clinical information and could serve as the primary source for automated clinical decision support systems. However, the creation of new corpora from EHRs in specific clinical domains is essential to facilitate the swift development of NER and RE models applied to EHRs for use in clinical practice. |
format | Online Article Text |
id | pubmed-10651400 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Korean Society of Medical Informatics |
record_format | MEDLINE/PubMed |
spelling | pubmed-106514002023-10-01 Named Entity Recognition in Electronic Health Records: A Methodological Review Durango, María C. Torres-Silva, Ever A. Orozco-Duque, Andrés Healthc Inform Res Review Article OBJECTIVES: A substantial portion of the data contained in Electronic Health Records (EHR) is unstructured, often appearing as free text. This format restricts its potential utility in clinical decision-making. Named entity recognition (NER) methods address the challenge of extracting pertinent information from unstructured text. The aim of this study was to outline the current NER methods and trace their evolution from 2011 to 2022. METHODS: We conducted a methodological literature review of NER methods, with a focus on distinguishing the classification models, the types of tagging systems, and the languages employed in various corpora. RESULTS: Several methods have been documented for automatically extracting relevant information from EHRs using natural language processing techniques such as NER and relation extraction (RE). These methods can automatically extract concepts, events, attributes, and other data, as well as the relationships between them. Most NER studies conducted thus far have utilized corpora in English or Chinese. Additionally, the bidirectional encoder representation from transformers using the BIO tagging system architecture is the most frequently reported classification scheme. We discovered a limited number of papers on the implementation of NER or RE tasks in EHRs within a specific clinical domain. CONCLUSIONS: EHRs play a pivotal role in gathering clinical information and could serve as the primary source for automated clinical decision support systems. However, the creation of new corpora from EHRs in specific clinical domains is essential to facilitate the swift development of NER and RE models applied to EHRs for use in clinical practice. Korean Society of Medical Informatics 2023-10 2023-10-31 /pmc/articles/PMC10651400/ /pubmed/37964451 http://dx.doi.org/10.4258/hir.2023.29.4.286 Text en © 2023 The Korean Society of Medical Informatics https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Review Article Durango, María C. Torres-Silva, Ever A. Orozco-Duque, Andrés Named Entity Recognition in Electronic Health Records: A Methodological Review |
title | Named Entity Recognition in Electronic Health Records: A Methodological Review |
title_full | Named Entity Recognition in Electronic Health Records: A Methodological Review |
title_fullStr | Named Entity Recognition in Electronic Health Records: A Methodological Review |
title_full_unstemmed | Named Entity Recognition in Electronic Health Records: A Methodological Review |
title_short | Named Entity Recognition in Electronic Health Records: A Methodological Review |
title_sort | named entity recognition in electronic health records: a methodological review |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10651400/ https://www.ncbi.nlm.nih.gov/pubmed/37964451 http://dx.doi.org/10.4258/hir.2023.29.4.286 |
work_keys_str_mv | AT durangomariac namedentityrecognitioninelectronichealthrecordsamethodologicalreview AT torressilvaevera namedentityrecognitioninelectronichealthrecordsamethodologicalreview AT orozcoduqueandres namedentityrecognitioninelectronichealthrecordsamethodologicalreview |