Cargando…

Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes

BACKGROUND: It has been shown that the entities in everyday clinical text are often expressed in a way that varies from how they are expressed in the nomenclature. Owing to lots of synonyms, abbreviations, medical jargons or even misspellings in the daily used physician notes in clinical information...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Rui, Liu, Jialin, Huang, Yong, Wang, Miye, Shi, Qingke, Chen, Jun, Zeng, Zhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5414139/
https://www.ncbi.nlm.nih.gov/pubmed/28464923
http://dx.doi.org/10.1186/s12911-017-0455-z
_version_ 1783233303887740928
author Zhang, Rui
Liu, Jialin
Huang, Yong
Wang, Miye
Shi, Qingke
Chen, Jun
Zeng, Zhi
author_facet Zhang, Rui
Liu, Jialin
Huang, Yong
Wang, Miye
Shi, Qingke
Chen, Jun
Zeng, Zhi
author_sort Zhang, Rui
collection PubMed
description BACKGROUND: It has been shown that the entities in everyday clinical text are often expressed in a way that varies from how they are expressed in the nomenclature. Owing to lots of synonyms, abbreviations, medical jargons or even misspellings in the daily used physician notes in clinical information system (CIS), the terminology without enough synonyms may not be adequately suitable for the task of Chinese clinical term recognition. METHODS: This paper demonstrates a validated system to retrieve the Chinese term of clinical finding (CTCF) from CIS and map them to the corresponding concepts of international clinical nomenclature, such as SNOMED CT. The system focuses on the SNOMED CT with Chinese synonyms enrichment (SCCSE). The literal similarity and the diagnosis-related similarity metrics were used for concept mapping. Two CTCF recognition methods, the rule- and terminology-based approach (RTBA) and the conditional random field machine learner (CRF), were adopted to identify the concepts in physician notes. The system was validated against the history of present illness annotated by clinical experts. The RTBA and CRF could be combined to predict new CTCFs besides SCCSE persistently. RESULTS: Around 59,000 CTCF candidates were accepted as valid and 39,000 of them occurred at least once in the history of present illness. 3,729 of them were accordant with the description in referenced Chinese clinical nomenclature, which could cross map to other international nomenclature such as SNOMED CT. With the hybrid similarity metrics, another 7,454 valid CTCFs (synonyms) were succeeded in concept mapping. For CTCF recognition in physician notes, a series of experiments were performed to find out the best CRF feature set, which gained an F-score of 0.887. The RTBA achieved a better F-score of 0.919 by the CTCF dictionary created in this research. CONCLUSIONS: This research demonstrated that it is feasible to help the SNOMED CT with Chinese synonyms enrichment based on physician notes in CIS. With continuous maintenance of SCCSE, the CTCFs could be precisely retrieved from free text, and the CTCFs arranged in semantic hierarchy of SNOMED CT could greatly improve the meaningful use of electronic health record in China. The methodology is also useful for clinical synonyms enrichment in other languages.
format Online
Article
Text
id pubmed-5414139
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54141392017-05-03 Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes Zhang, Rui Liu, Jialin Huang, Yong Wang, Miye Shi, Qingke Chen, Jun Zeng, Zhi BMC Med Inform Decis Mak Research Article BACKGROUND: It has been shown that the entities in everyday clinical text are often expressed in a way that varies from how they are expressed in the nomenclature. Owing to lots of synonyms, abbreviations, medical jargons or even misspellings in the daily used physician notes in clinical information system (CIS), the terminology without enough synonyms may not be adequately suitable for the task of Chinese clinical term recognition. METHODS: This paper demonstrates a validated system to retrieve the Chinese term of clinical finding (CTCF) from CIS and map them to the corresponding concepts of international clinical nomenclature, such as SNOMED CT. The system focuses on the SNOMED CT with Chinese synonyms enrichment (SCCSE). The literal similarity and the diagnosis-related similarity metrics were used for concept mapping. Two CTCF recognition methods, the rule- and terminology-based approach (RTBA) and the conditional random field machine learner (CRF), were adopted to identify the concepts in physician notes. The system was validated against the history of present illness annotated by clinical experts. The RTBA and CRF could be combined to predict new CTCFs besides SCCSE persistently. RESULTS: Around 59,000 CTCF candidates were accepted as valid and 39,000 of them occurred at least once in the history of present illness. 3,729 of them were accordant with the description in referenced Chinese clinical nomenclature, which could cross map to other international nomenclature such as SNOMED CT. With the hybrid similarity metrics, another 7,454 valid CTCFs (synonyms) were succeeded in concept mapping. For CTCF recognition in physician notes, a series of experiments were performed to find out the best CRF feature set, which gained an F-score of 0.887. The RTBA achieved a better F-score of 0.919 by the CTCF dictionary created in this research. CONCLUSIONS: This research demonstrated that it is feasible to help the SNOMED CT with Chinese synonyms enrichment based on physician notes in CIS. With continuous maintenance of SCCSE, the CTCFs could be precisely retrieved from free text, and the CTCFs arranged in semantic hierarchy of SNOMED CT could greatly improve the meaningful use of electronic health record in China. The methodology is also useful for clinical synonyms enrichment in other languages. BioMed Central 2017-05-02 /pmc/articles/PMC5414139/ /pubmed/28464923 http://dx.doi.org/10.1186/s12911-017-0455-z Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Zhang, Rui
Liu, Jialin
Huang, Yong
Wang, Miye
Shi, Qingke
Chen, Jun
Zeng, Zhi
Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes
title Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes
title_full Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes
title_fullStr Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes
title_full_unstemmed Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes
title_short Enriching the international clinical nomenclature with Chinese daily used synonyms and concept recognition in physician notes
title_sort enriching the international clinical nomenclature with chinese daily used synonyms and concept recognition in physician notes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5414139/
https://www.ncbi.nlm.nih.gov/pubmed/28464923
http://dx.doi.org/10.1186/s12911-017-0455-z
work_keys_str_mv AT zhangrui enrichingtheinternationalclinicalnomenclaturewithchinesedailyusedsynonymsandconceptrecognitioninphysiciannotes
AT liujialin enrichingtheinternationalclinicalnomenclaturewithchinesedailyusedsynonymsandconceptrecognitioninphysiciannotes
AT huangyong enrichingtheinternationalclinicalnomenclaturewithchinesedailyusedsynonymsandconceptrecognitioninphysiciannotes
AT wangmiye enrichingtheinternationalclinicalnomenclaturewithchinesedailyusedsynonymsandconceptrecognitioninphysiciannotes
AT shiqingke enrichingtheinternationalclinicalnomenclaturewithchinesedailyusedsynonymsandconceptrecognitioninphysiciannotes
AT chenjun enrichingtheinternationalclinicalnomenclaturewithchinesedailyusedsynonymsandconceptrecognitioninphysiciannotes
AT zengzhi enrichingtheinternationalclinicalnomenclaturewithchinesedailyusedsynonymsandconceptrecognitioninphysiciannotes