Cargando…

Lab indicators standardization method for the regional healthcare platform: a case study on heart failure

BACKGROUND: Laboratory indicator test results in electronic health records have been applied to many clinical big data analysis. However, it is quite common that the same laboratory examination item (i.e., lab indicator) is presented using different names in Chinese due to the translation problem an...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Ming, Zhang, ZhiXing, Zhang, JiaYing, Ruan, Tong, Ye, Qi, He, Ping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7739485/
https://www.ncbi.nlm.nih.gov/pubmed/33323114
http://dx.doi.org/10.1186/s12911-020-01324-6
_version_ 1783623340596920320
author Liang, Ming
Zhang, ZhiXing
Zhang, JiaYing
Ruan, Tong
Ye, Qi
He, Ping
author_facet Liang, Ming
Zhang, ZhiXing
Zhang, JiaYing
Ruan, Tong
Ye, Qi
He, Ping
author_sort Liang, Ming
collection PubMed
description BACKGROUND: Laboratory indicator test results in electronic health records have been applied to many clinical big data analysis. However, it is quite common that the same laboratory examination item (i.e., lab indicator) is presented using different names in Chinese due to the translation problem and the habit problem of various hospitals, which results in distortion of analysis results. METHODS: A framework with a recall model and a binary classification model is proposed, which could reduce the alignment scale and improve the accuracy of lab indicator normalization. To reduce alignment scale, tf-idf is used for candidate selection. To assure the accuracy of output, we utilize enhanced sequential inference model for binary classification. And active learning is applied with a selection strategy which is proposed for reducing annotation cost. RESULTS: Since our indicator standardization method mainly focuses on Chinese indicator inconsistency, we perform our experiment on Shanghai Hospital Development Center and select clinical data from 8 hospitals. The method achieves a F1-score 92.08[Formula: see text] in our final binary classification. As for active learning, the new strategy proposed performs better than random baseline and could outperform the result trained on full data with only 43[Formula: see text] training data. A case study on heart failure clinic analysis conducted on the sub-dataset collected from SHDC shows that our proposed method is practical in the application with good performance. CONCLUSION: This work demonstrates that the structure we proposed can be effectively applied to lab indicator normalization. And active learning is also suitable for this task for cost reduction. Such a method is also valuable in data cleaning, data mining, text extracting and entity alignment.
format Online
Article
Text
id pubmed-7739485
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77394852020-12-17 Lab indicators standardization method for the regional healthcare platform: a case study on heart failure Liang, Ming Zhang, ZhiXing Zhang, JiaYing Ruan, Tong Ye, Qi He, Ping BMC Med Inform Decis Mak Research BACKGROUND: Laboratory indicator test results in electronic health records have been applied to many clinical big data analysis. However, it is quite common that the same laboratory examination item (i.e., lab indicator) is presented using different names in Chinese due to the translation problem and the habit problem of various hospitals, which results in distortion of analysis results. METHODS: A framework with a recall model and a binary classification model is proposed, which could reduce the alignment scale and improve the accuracy of lab indicator normalization. To reduce alignment scale, tf-idf is used for candidate selection. To assure the accuracy of output, we utilize enhanced sequential inference model for binary classification. And active learning is applied with a selection strategy which is proposed for reducing annotation cost. RESULTS: Since our indicator standardization method mainly focuses on Chinese indicator inconsistency, we perform our experiment on Shanghai Hospital Development Center and select clinical data from 8 hospitals. The method achieves a F1-score 92.08[Formula: see text] in our final binary classification. As for active learning, the new strategy proposed performs better than random baseline and could outperform the result trained on full data with only 43[Formula: see text] training data. A case study on heart failure clinic analysis conducted on the sub-dataset collected from SHDC shows that our proposed method is practical in the application with good performance. CONCLUSION: This work demonstrates that the structure we proposed can be effectively applied to lab indicator normalization. And active learning is also suitable for this task for cost reduction. Such a method is also valuable in data cleaning, data mining, text extracting and entity alignment. BioMed Central 2020-12-15 /pmc/articles/PMC7739485/ /pubmed/33323114 http://dx.doi.org/10.1186/s12911-020-01324-6 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liang, Ming
Zhang, ZhiXing
Zhang, JiaYing
Ruan, Tong
Ye, Qi
He, Ping
Lab indicators standardization method for the regional healthcare platform: a case study on heart failure
title Lab indicators standardization method for the regional healthcare platform: a case study on heart failure
title_full Lab indicators standardization method for the regional healthcare platform: a case study on heart failure
title_fullStr Lab indicators standardization method for the regional healthcare platform: a case study on heart failure
title_full_unstemmed Lab indicators standardization method for the regional healthcare platform: a case study on heart failure
title_short Lab indicators standardization method for the regional healthcare platform: a case study on heart failure
title_sort lab indicators standardization method for the regional healthcare platform: a case study on heart failure
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7739485/
https://www.ncbi.nlm.nih.gov/pubmed/33323114
http://dx.doi.org/10.1186/s12911-020-01324-6
work_keys_str_mv AT liangming labindicatorsstandardizationmethodfortheregionalhealthcareplatformacasestudyonheartfailure
AT zhangzhixing labindicatorsstandardizationmethodfortheregionalhealthcareplatformacasestudyonheartfailure
AT zhangjiaying labindicatorsstandardizationmethodfortheregionalhealthcareplatformacasestudyonheartfailure
AT ruantong labindicatorsstandardizationmethodfortheregionalhealthcareplatformacasestudyonheartfailure
AT yeqi labindicatorsstandardizationmethodfortheregionalhealthcareplatformacasestudyonheartfailure
AT heping labindicatorsstandardizationmethodfortheregionalhealthcareplatformacasestudyonheartfailure