Cargando…
Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH
BACKGROUND: International Classification of Diseases (ICD) code–based claims databases are often used to study infective endocarditis (IE). However, the quality of ICD coding can influence the reliability of IE research. The impact of complementing the ICD-only approach with data extracted from elec...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
China Medical University
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8823496/ https://www.ncbi.nlm.nih.gov/pubmed/35223412 http://dx.doi.org/10.37796/2211-8039.1267 |
_version_ | 1784646814439309312 |
---|---|
author | Chiang, Hsiu-Yin Liang, Li-Ying Lin, Che-Chen Chen, Yi-Jin Wu, Min-Yen Chen, Sheng-Hsuan Wu, Pin-Hua Kuo, Chin-Chi Chi, Chih-Yu |
author_facet | Chiang, Hsiu-Yin Liang, Li-Ying Lin, Che-Chen Chen, Yi-Jin Wu, Min-Yen Chen, Sheng-Hsuan Wu, Pin-Hua Kuo, Chin-Chi Chi, Chih-Yu |
author_sort | Chiang, Hsiu-Yin |
collection | PubMed |
description | BACKGROUND: International Classification of Diseases (ICD) code–based claims databases are often used to study infective endocarditis (IE). However, the quality of ICD coding can influence the reliability of IE research. The impact of complementing the ICD-only approach with data extracted from electronic medical records (EMRs) has yet to be explored. METHODS: We selected the information of adult patients with discharge ICD codes for IE (ICD-9: 421, 112.81, 036.42, 098.84, 115.04, 115.14, 115.94, 424.9; ICD-10: I33, I38, I39) during 2005–2016 in China Medical University Hospital. Data extraction was conducted on the basis of the modified Duke criteria to establish a reference group comprising patients with definite or possible IE. Clinical characteristics and in-hospital mortality were compared between ICD-identified and Duke-confirmed cases. The positive predictive value (PPV) was used to quantify the IE identification performance of various phenotyping algorithms. RESULTS: A total of 593 patients with discharge ICD codes for IE were identified, only 56.7% met the modified Duke criteria. The crude in-hospital mortality for Duke-confirmed and Duke-rejected IE were 24.4% and 8.2%, respectively. The adjusted in-hospital mortality for ICD-identified IE was lower than that for Duke-confirmed IE by a difference of 5.1%. The best PPV was achieved (0.90, 95% CI 0.86–0.93) when major components of the Duke criteria (positive blood culture and vegetation) were integrated with ICD codes. CONCLUSION: Integrating EMR data can considerably improve the accuracy of ICD-only approaches in phenotyping IE, which can improve the validity of EMR-based studies and their applications, including real-time surveillance and clinical decision support. |
format | Online Article Text |
id | pubmed-8823496 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | China Medical University |
record_format | MEDLINE/PubMed |
spelling | pubmed-88234962022-02-25 Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH Chiang, Hsiu-Yin Liang, Li-Ying Lin, Che-Chen Chen, Yi-Jin Wu, Min-Yen Chen, Sheng-Hsuan Wu, Pin-Hua Kuo, Chin-Chi Chi, Chih-Yu Biomedicine (Taipei) Original Article BACKGROUND: International Classification of Diseases (ICD) code–based claims databases are often used to study infective endocarditis (IE). However, the quality of ICD coding can influence the reliability of IE research. The impact of complementing the ICD-only approach with data extracted from electronic medical records (EMRs) has yet to be explored. METHODS: We selected the information of adult patients with discharge ICD codes for IE (ICD-9: 421, 112.81, 036.42, 098.84, 115.04, 115.14, 115.94, 424.9; ICD-10: I33, I38, I39) during 2005–2016 in China Medical University Hospital. Data extraction was conducted on the basis of the modified Duke criteria to establish a reference group comprising patients with definite or possible IE. Clinical characteristics and in-hospital mortality were compared between ICD-identified and Duke-confirmed cases. The positive predictive value (PPV) was used to quantify the IE identification performance of various phenotyping algorithms. RESULTS: A total of 593 patients with discharge ICD codes for IE were identified, only 56.7% met the modified Duke criteria. The crude in-hospital mortality for Duke-confirmed and Duke-rejected IE were 24.4% and 8.2%, respectively. The adjusted in-hospital mortality for ICD-identified IE was lower than that for Duke-confirmed IE by a difference of 5.1%. The best PPV was achieved (0.90, 95% CI 0.86–0.93) when major components of the Duke criteria (positive blood culture and vegetation) were integrated with ICD codes. CONCLUSION: Integrating EMR data can considerably improve the accuracy of ICD-only approaches in phenotyping IE, which can improve the validity of EMR-based studies and their applications, including real-time surveillance and clinical decision support. China Medical University 2021-09-01 /pmc/articles/PMC8823496/ /pubmed/35223412 http://dx.doi.org/10.37796/2211-8039.1267 Text en © the Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ). |
spellingShingle | Original Article Chiang, Hsiu-Yin Liang, Li-Ying Lin, Che-Chen Chen, Yi-Jin Wu, Min-Yen Chen, Sheng-Hsuan Wu, Pin-Hua Kuo, Chin-Chi Chi, Chih-Yu Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH |
title | Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH |
title_full | Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH |
title_fullStr | Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH |
title_full_unstemmed | Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH |
title_short | Electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of CMUH |
title_sort | electronic medical record-based deep data cleaning and phenotyping improve the diagnostic validity and mortality assessment of infective endocarditis: medical big data initiative of cmuh |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8823496/ https://www.ncbi.nlm.nih.gov/pubmed/35223412 http://dx.doi.org/10.37796/2211-8039.1267 |
work_keys_str_mv | AT chianghsiuyin electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh AT liangliying electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh AT linchechen electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh AT chenyijin electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh AT wuminyen electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh AT chenshenghsuan electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh AT wupinhua electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh AT kuochinchi electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh AT chichihyu electronicmedicalrecordbaseddeepdatacleaningandphenotypingimprovethediagnosticvalidityandmortalityassessmentofinfectiveendocarditismedicalbigdatainitiativeofcmuh |