Cargando…

Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms

OBJECTIVE: To demonstrate the utility of growthcleanr, an anthropometric data cleaning method designed for electronic health records (EHR). MATERIALS AND METHODS: We used all available pediatric and adult height and weight data from an ongoing observational study that includes EHR data from 15 healt...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Pi-I D, Rifas-Shiman, Sheryl L, Aris, Izzuddin M, Daley, Matthew F, Janicke, David M, Heerman, William J, Chudnov, Daniel L, Freedman, David S, Block, Jason P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9629892/
https://www.ncbi.nlm.nih.gov/pubmed/36339053
http://dx.doi.org/10.1093/jamiaopen/ooac089
_version_ 1784823488881623040
author Lin, Pi-I D
Rifas-Shiman, Sheryl L
Aris, Izzuddin M
Daley, Matthew F
Janicke, David M
Heerman, William J
Chudnov, Daniel L
Freedman, David S
Block, Jason P
author_facet Lin, Pi-I D
Rifas-Shiman, Sheryl L
Aris, Izzuddin M
Daley, Matthew F
Janicke, David M
Heerman, William J
Chudnov, Daniel L
Freedman, David S
Block, Jason P
author_sort Lin, Pi-I D
collection PubMed
description OBJECTIVE: To demonstrate the utility of growthcleanr, an anthropometric data cleaning method designed for electronic health records (EHR). MATERIALS AND METHODS: We used all available pediatric and adult height and weight data from an ongoing observational study that includes EHR data from 15 healthcare systems and applied growthcleanr to identify outliers and errors and compared its performance in pediatric data with 2 other pediatric data cleaning methods: (1) conditional percentile (cp) and (2) PaEdiatric ANthropometric measurement Outlier Flagging pipeline (peanof). RESULTS: 687 226 children (<20 years) and 3 267 293 adults contributed 71 246 369 weight and 51 525 487 height measurements. growthcleanr flagged 18% of pediatric and 12% of adult measurements for exclusion, mostly as carried-forward measures for pediatric data and duplicates for adult and pediatric data. After removing the flagged measurements, 0.5% and 0.6% of the pediatric heights and weights and 0.3% and 1.4% of the adult heights and weights, respectively, were biologically implausible according to the CDC and other established cut points. Compared with other pediatric cleaning methods, growthcleanr flagged the most measurements for exclusion; however, it did not flag some more extreme measurements. The prevalence of severe pediatric obesity was 9.0%, 9.2%, and 8.0% after cleaning by growthcleanr, cp, and peanof, respectively. CONCLUSION: growthcleanr is useful for cleaning pediatric and adult height and weight data. It is the only method with the ability to clean adult data and identify carried-forward and duplicates, which are prevalent in EHR. Findings of this study can be used to improve the growthcleanr algorithm.
format Online
Article
Text
id pubmed-9629892
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96298922022-11-04 Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms Lin, Pi-I D Rifas-Shiman, Sheryl L Aris, Izzuddin M Daley, Matthew F Janicke, David M Heerman, William J Chudnov, Daniel L Freedman, David S Block, Jason P JAMIA Open Research and Applications OBJECTIVE: To demonstrate the utility of growthcleanr, an anthropometric data cleaning method designed for electronic health records (EHR). MATERIALS AND METHODS: We used all available pediatric and adult height and weight data from an ongoing observational study that includes EHR data from 15 healthcare systems and applied growthcleanr to identify outliers and errors and compared its performance in pediatric data with 2 other pediatric data cleaning methods: (1) conditional percentile (cp) and (2) PaEdiatric ANthropometric measurement Outlier Flagging pipeline (peanof). RESULTS: 687 226 children (<20 years) and 3 267 293 adults contributed 71 246 369 weight and 51 525 487 height measurements. growthcleanr flagged 18% of pediatric and 12% of adult measurements for exclusion, mostly as carried-forward measures for pediatric data and duplicates for adult and pediatric data. After removing the flagged measurements, 0.5% and 0.6% of the pediatric heights and weights and 0.3% and 1.4% of the adult heights and weights, respectively, were biologically implausible according to the CDC and other established cut points. Compared with other pediatric cleaning methods, growthcleanr flagged the most measurements for exclusion; however, it did not flag some more extreme measurements. The prevalence of severe pediatric obesity was 9.0%, 9.2%, and 8.0% after cleaning by growthcleanr, cp, and peanof, respectively. CONCLUSION: growthcleanr is useful for cleaning pediatric and adult height and weight data. It is the only method with the ability to clean adult data and identify carried-forward and duplicates, which are prevalent in EHR. Findings of this study can be used to improve the growthcleanr algorithm. Oxford University Press 2022-11-02 /pmc/articles/PMC9629892/ /pubmed/36339053 http://dx.doi.org/10.1093/jamiaopen/ooac089 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Lin, Pi-I D
Rifas-Shiman, Sheryl L
Aris, Izzuddin M
Daley, Matthew F
Janicke, David M
Heerman, William J
Chudnov, Daniel L
Freedman, David S
Block, Jason P
Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms
title Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms
title_full Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms
title_fullStr Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms
title_full_unstemmed Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms
title_short Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms
title_sort cleaning of anthropometric data from pcornet electronic health records using automated algorithms
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9629892/
https://www.ncbi.nlm.nih.gov/pubmed/36339053
http://dx.doi.org/10.1093/jamiaopen/ooac089
work_keys_str_mv AT linpiid cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms
AT rifasshimansheryll cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms
AT arisizzuddinm cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms
AT daleymatthewf cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms
AT janickedavidm cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms
AT heermanwilliamj cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms
AT chudnovdaniell cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms
AT freedmandavids cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms
AT blockjasonp cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms