Cargando…

Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH

BACKGROUND: International Classification of Diseases (ICD) code–based claims databases are often used to study infective endocarditis (IE). However, the quality of ICD coding can influence the reliability of IE research. The impact of complementing the ICD-only approach with data extracted from elec...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiang, Hsiu-Yin, Liang, Li-Ying, Lin, Che-Chen, Chen, Yi-Jin, Wu, Min-Yen, Chen, Sheng-Hsuan, Wu, Pin-Hua, Kuo, Chin-Chi, Chi, Chih-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: China Medical University 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8823496/
https://www.ncbi.nlm.nih.gov/pubmed/35223412
http://dx.doi.org/10.37796/2211-8039.1267
_version_ 1784646814439309312
author Chiang, Hsiu-Yin
Liang, Li-Ying
Lin, Che-Chen
Chen, Yi-Jin
Wu, Min-Yen
Chen, Sheng-Hsuan
Wu, Pin-Hua
Kuo, Chin-Chi
Chi, Chih-Yu
author_facet Chiang, Hsiu-Yin
Liang, Li-Ying
Lin, Che-Chen
Chen, Yi-Jin
Wu, Min-Yen
Chen, Sheng-Hsuan
Wu, Pin-Hua
Kuo, Chin-Chi
Chi, Chih-Yu
author_sort Chiang, Hsiu-Yin
collection PubMed
description BACKGROUND: International Classification of Diseases (ICD) code–based claims databases are often used to study infective endocarditis (IE). However, the quality of ICD coding can influence the reliability of IE research. The impact of complementing the ICD-only approach with data extracted from electronic medical records (EMRs) has yet to be explored. METHODS: We selected the information of adult patients with discharge ICD codes for IE (ICD-9: 421, 112.81, 036.42, 098.84, 115.04, 115.14, 115.94, 424.9; ICD-10: I33, I38, I39) during 2005–2016 in China Medical University Hospital. Data extraction was conducted on the basis of the modified Duke criteria to establish a reference group comprising patients with definite or possible IE. Clinical characteristics and in-hospital mortality were compared between ICD-identified and Duke-confirmed cases. The positive predictive value (PPV) was used to quantify the IE identification performance of various phenotyping algorithms. RESULTS: A total of 593 patients with discharge ICD codes for IE were identified, only 56.7% met the modified Duke criteria. The crude in-hospital mortality for Duke-confirmed and Duke-rejected IE were 24.4% and 8.2%, respectively. The adjusted in-hospital mortality for ICD-identified IE was lower than that for Duke-confirmed IE by a difference of 5.1%. The best PPV was achieved (0.90, 95% CI 0.86–0.93) when major components of the Duke criteria (positive blood culture and vegetation) were integrated with ICD codes. CONCLUSION: Integrating EMR data can considerably improve the accuracy of ICD-only approaches in phenotyping IE, which can improve the validity of EMR-based studies and their applications, including real-time surveillance and clinical decision support.
format Online
Article
Text
id pubmed-8823496
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher China Medical University
record_format MEDLINE/PubMed
spelling pubmed-88234962022-02-25 Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH Chiang, Hsiu-Yin Liang, Li-Ying Lin, Che-Chen Chen, Yi-Jin Wu, Min-Yen Chen, Sheng-Hsuan Wu, Pin-Hua Kuo, Chin-Chi Chi, Chih-Yu Biomedicine (Taipei) Original Article BACKGROUND: International Classification of Diseases (ICD) code–based claims databases are often used to study infective endocarditis (IE). However, the quality of ICD coding can influence the reliability of IE research. The impact of complementing the ICD-only approach with data extracted from electronic medical records (EMRs) has yet to be explored. METHODS: We selected the information of adult patients with discharge ICD codes for IE (ICD-9: 421, 112.81, 036.42, 098.84, 115.04, 115.14, 115.94, 424.9; ICD-10: I33, I38, I39) during 2005–2016 in China Medical University Hospital. Data extraction was conducted on the basis of the modified Duke criteria to establish a reference group comprising patients with definite or possible IE. Clinical characteristics and in-hospital mortality were compared between ICD-identified and Duke-confirmed cases. The positive predictive value (PPV) was used to quantify the IE identification performance of various phenotyping algorithms. RESULTS: A total of 593 patients with discharge ICD codes for IE were identified, only 56.7% met the modified Duke criteria. The crude in-hospital mortality for Duke-confirmed and Duke-rejected IE were 24.4% and 8.2%, respectively. The adjusted in-hospital mortality for ICD-identified IE was lower than that for Duke-confirmed IE by a difference of 5.1%. The best PPV was achieved (0.90, 95% CI 0.86–0.93) when major components of the Duke criteria (positive blood culture and vegetation) were integrated with ICD codes. CONCLUSION: Integrating EMR data can considerably improve the accuracy of ICD-only approaches in phenotyping IE, which can improve the validity of EMR-based studies and their applications, including real-time surveillance and clinical decision support. China Medical University 2021-09-01 /pmc/articles/PMC8823496/ /pubmed/35223412 http://dx.doi.org/10.37796/2211-8039.1267 Text en © the Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Original Article
Chiang, Hsiu-Yin
Liang, Li-Ying
Lin, Che-Chen
Chen, Yi-Jin
Wu, Min-Yen
Chen, Sheng-Hsuan
Wu, Pin-Hua
Kuo, Chin-Chi
Chi, Chih-Yu
Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH
title Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH
title_full Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH
title_fullStr Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH
title_full_unstemmed Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH
title_short Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH
title_sort electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of cmuh
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8823496/
https://www.ncbi.nlm.nih.gov/pubmed/35223412
http://dx.doi.org/10.37796/2211-8039.1267
work_keys_str_mv AT chianghsiuyin electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh
AT liangliying electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh
AT linchechen electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh
AT chenyijin electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh
AT wuminyen electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh
AT chenshenghsuan electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh
AT wupinhua electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh
AT kuochinchi electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh
AT chichihyu electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh