Cargando…

Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports

BACKGROUND: Cardiac magnetic resonance (CMR) imaging is important for diagnosis and risk stratification of hypertrophic cardiomyopathy (HCM) patients. However, collection of information from large numbers of CMR reports by manual review is time-consuming, error-prone and costly. Natural language pro...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dewaswala, Nakeya, Chen, David, Bhopalwala, Huzefa, Kaggal, Vinod C., Murphy, Sean P., Bos, J. Martijn, Geske, Jeffrey B., Gersh, Bernard J., Ommen, Steve R., Araoz, Philip A., Ackerman, Michael J., Arruda-Olson, Adelaide M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580188/ https://www.ncbi.nlm.nih.gov/pubmed/36258218 http://dx.doi.org/10.1186/s12911-022-02017-y

_version_	1784812337289494528
author	Dewaswala, Nakeya Chen, David Bhopalwala, Huzefa Kaggal, Vinod C. Murphy, Sean P. Bos, J. Martijn Geske, Jeffrey B. Gersh, Bernard J. Ommen, Steve R. Araoz, Philip A. Ackerman, Michael J. Arruda-Olson, Adelaide M.
author_facet	Dewaswala, Nakeya Chen, David Bhopalwala, Huzefa Kaggal, Vinod C. Murphy, Sean P. Bos, J. Martijn Geske, Jeffrey B. Gersh, Bernard J. Ommen, Steve R. Araoz, Philip A. Ackerman, Michael J. Arruda-Olson, Adelaide M.
author_sort	Dewaswala, Nakeya
collection	PubMed
description	BACKGROUND: Cardiac magnetic resonance (CMR) imaging is important for diagnosis and risk stratification of hypertrophic cardiomyopathy (HCM) patients. However, collection of information from large numbers of CMR reports by manual review is time-consuming, error-prone and costly. Natural language processing (NLP) is an artificial intelligence method for automated extraction of information from narrative text including text in CMR reports in electronic health records (EHR). Our objective was to assess whether NLP can accurately extract diagnosis of HCM from CMR reports. METHODS: An NLP system with two tiers was developed for information extraction from narrative text in CMR reports; the first tier extracted information regarding HCM diagnosis while the second extracted categorical and numeric concepts for HCM classification. We randomly allocated 200 HCM patients with CMR reports from 2004 to 2018 into training (100 patients with 185 CMR reports) and testing sets (100 patients with 206 reports). RESULTS: NLP algorithms demonstrated very high performance compared to manual annotation. The algorithm to extract HCM diagnosis had accuracy of 0.99. The accuracy for categorical concepts included HCM morphologic subtype 0.99, systolic anterior motion of the mitral valve 0.96, mitral regurgitation 0.93, left ventricular (LV) obstruction 0.94, location of obstruction 0.92, apical pouch 0.98, LV delayed enhancement 0.93, left atrial enlargement 0.99 and right atrial enlargement 0.98. Accuracy for numeric concepts included maximal LV wall thickness 0.96, LV mass 0.99, LV mass index 0.98, LV ejection fraction 0.98 and right ventricular ejection fraction 0.99. CONCLUSIONS: NLP identified and classified HCM from CMR narrative text reports with very high performance. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-02017-y.
format	Online Article Text
id	pubmed-9580188
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-95801882022-10-20 Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports Dewaswala, Nakeya Chen, David Bhopalwala, Huzefa Kaggal, Vinod C. Murphy, Sean P. Bos, J. Martijn Geske, Jeffrey B. Gersh, Bernard J. Ommen, Steve R. Araoz, Philip A. Ackerman, Michael J. Arruda-Olson, Adelaide M. BMC Med Inform Decis Mak Research BACKGROUND: Cardiac magnetic resonance (CMR) imaging is important for diagnosis and risk stratification of hypertrophic cardiomyopathy (HCM) patients. However, collection of information from large numbers of CMR reports by manual review is time-consuming, error-prone and costly. Natural language processing (NLP) is an artificial intelligence method for automated extraction of information from narrative text including text in CMR reports in electronic health records (EHR). Our objective was to assess whether NLP can accurately extract diagnosis of HCM from CMR reports. METHODS: An NLP system with two tiers was developed for information extraction from narrative text in CMR reports; the first tier extracted information regarding HCM diagnosis while the second extracted categorical and numeric concepts for HCM classification. We randomly allocated 200 HCM patients with CMR reports from 2004 to 2018 into training (100 patients with 185 CMR reports) and testing sets (100 patients with 206 reports). RESULTS: NLP algorithms demonstrated very high performance compared to manual annotation. The algorithm to extract HCM diagnosis had accuracy of 0.99. The accuracy for categorical concepts included HCM morphologic subtype 0.99, systolic anterior motion of the mitral valve 0.96, mitral regurgitation 0.93, left ventricular (LV) obstruction 0.94, location of obstruction 0.92, apical pouch 0.98, LV delayed enhancement 0.93, left atrial enlargement 0.99 and right atrial enlargement 0.98. Accuracy for numeric concepts included maximal LV wall thickness 0.96, LV mass 0.99, LV mass index 0.98, LV ejection fraction 0.98 and right ventricular ejection fraction 0.99. CONCLUSIONS: NLP identified and classified HCM from CMR narrative text reports with very high performance. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-02017-y. BioMed Central 2022-10-18 /pmc/articles/PMC9580188/ /pubmed/36258218 http://dx.doi.org/10.1186/s12911-022-02017-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Dewaswala, Nakeya Chen, David Bhopalwala, Huzefa Kaggal, Vinod C. Murphy, Sean P. Bos, J. Martijn Geske, Jeffrey B. Gersh, Bernard J. Ommen, Steve R. Araoz, Philip A. Ackerman, Michael J. Arruda-Olson, Adelaide M. Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
title	Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
title_full	Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
title_fullStr	Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
title_full_unstemmed	Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
title_short	Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
title_sort	natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580188/ https://www.ncbi.nlm.nih.gov/pubmed/36258218 http://dx.doi.org/10.1186/s12911-022-02017-y
work_keys_str_mv	AT dewaswalanakeya naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT chendavid naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT bhopalwalahuzefa naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT kaggalvinodc naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT murphyseanp naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT bosjmartijn naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT geskejeffreyb naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT gershbernardj naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT ommenstever naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT araozphilipa naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT ackermanmichaelj naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports AT arrudaolsonadelaidem naturallanguageprocessingforidentificationofhypertrophiccardiomyopathypatientsfromcardiacmagneticresonancereports

Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports

Ejemplares similares