Cargando…
Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms
OBJECTIVE: To demonstrate the utility of growthcleanr, an anthropometric data cleaning method designed for electronic health records (EHR). MATERIALS AND METHODS: We used all available pediatric and adult height and weight data from an ongoing observational study that includes EHR data from 15 healt...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9629892/ https://www.ncbi.nlm.nih.gov/pubmed/36339053 http://dx.doi.org/10.1093/jamiaopen/ooac089 |
_version_ | 1784823488881623040 |
---|---|
author | Lin, Pi-I D Rifas-Shiman, Sheryl L Aris, Izzuddin M Daley, Matthew F Janicke, David M Heerman, William J Chudnov, Daniel L Freedman, David S Block, Jason P |
author_facet | Lin, Pi-I D Rifas-Shiman, Sheryl L Aris, Izzuddin M Daley, Matthew F Janicke, David M Heerman, William J Chudnov, Daniel L Freedman, David S Block, Jason P |
author_sort | Lin, Pi-I D |
collection | PubMed |
description | OBJECTIVE: To demonstrate the utility of growthcleanr, an anthropometric data cleaning method designed for electronic health records (EHR). MATERIALS AND METHODS: We used all available pediatric and adult height and weight data from an ongoing observational study that includes EHR data from 15 healthcare systems and applied growthcleanr to identify outliers and errors and compared its performance in pediatric data with 2 other pediatric data cleaning methods: (1) conditional percentile (cp) and (2) PaEdiatric ANthropometric measurement Outlier Flagging pipeline (peanof). RESULTS: 687 226 children (<20 years) and 3 267 293 adults contributed 71 246 369 weight and 51 525 487 height measurements. growthcleanr flagged 18% of pediatric and 12% of adult measurements for exclusion, mostly as carried-forward measures for pediatric data and duplicates for adult and pediatric data. After removing the flagged measurements, 0.5% and 0.6% of the pediatric heights and weights and 0.3% and 1.4% of the adult heights and weights, respectively, were biologically implausible according to the CDC and other established cut points. Compared with other pediatric cleaning methods, growthcleanr flagged the most measurements for exclusion; however, it did not flag some more extreme measurements. The prevalence of severe pediatric obesity was 9.0%, 9.2%, and 8.0% after cleaning by growthcleanr, cp, and peanof, respectively. CONCLUSION: growthcleanr is useful for cleaning pediatric and adult height and weight data. It is the only method with the ability to clean adult data and identify carried-forward and duplicates, which are prevalent in EHR. Findings of this study can be used to improve the growthcleanr algorithm. |
format | Online Article Text |
id | pubmed-9629892 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-96298922022-11-04 Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms Lin, Pi-I D Rifas-Shiman, Sheryl L Aris, Izzuddin M Daley, Matthew F Janicke, David M Heerman, William J Chudnov, Daniel L Freedman, David S Block, Jason P JAMIA Open Research and Applications OBJECTIVE: To demonstrate the utility of growthcleanr, an anthropometric data cleaning method designed for electronic health records (EHR). MATERIALS AND METHODS: We used all available pediatric and adult height and weight data from an ongoing observational study that includes EHR data from 15 healthcare systems and applied growthcleanr to identify outliers and errors and compared its performance in pediatric data with 2 other pediatric data cleaning methods: (1) conditional percentile (cp) and (2) PaEdiatric ANthropometric measurement Outlier Flagging pipeline (peanof). RESULTS: 687 226 children (<20 years) and 3 267 293 adults contributed 71 246 369 weight and 51 525 487 height measurements. growthcleanr flagged 18% of pediatric and 12% of adult measurements for exclusion, mostly as carried-forward measures for pediatric data and duplicates for adult and pediatric data. After removing the flagged measurements, 0.5% and 0.6% of the pediatric heights and weights and 0.3% and 1.4% of the adult heights and weights, respectively, were biologically implausible according to the CDC and other established cut points. Compared with other pediatric cleaning methods, growthcleanr flagged the most measurements for exclusion; however, it did not flag some more extreme measurements. The prevalence of severe pediatric obesity was 9.0%, 9.2%, and 8.0% after cleaning by growthcleanr, cp, and peanof, respectively. CONCLUSION: growthcleanr is useful for cleaning pediatric and adult height and weight data. It is the only method with the ability to clean adult data and identify carried-forward and duplicates, which are prevalent in EHR. Findings of this study can be used to improve the growthcleanr algorithm. Oxford University Press 2022-11-02 /pmc/articles/PMC9629892/ /pubmed/36339053 http://dx.doi.org/10.1093/jamiaopen/ooac089 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Research and Applications Lin, Pi-I D Rifas-Shiman, Sheryl L Aris, Izzuddin M Daley, Matthew F Janicke, David M Heerman, William J Chudnov, Daniel L Freedman, David S Block, Jason P Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms |
title | Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms |
title_full | Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms |
title_fullStr | Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms |
title_full_unstemmed | Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms |
title_short | Cleaning of anthropometric data from PCORnet electronic health records using automated algorithms |
title_sort | cleaning of anthropometric data from pcornet electronic health records using automated algorithms |
topic | Research and Applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9629892/ https://www.ncbi.nlm.nih.gov/pubmed/36339053 http://dx.doi.org/10.1093/jamiaopen/ooac089 |
work_keys_str_mv | AT linpiid cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms AT rifasshimansheryll cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms AT arisizzuddinm cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms AT daleymatthewf cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms AT janickedavidm cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms AT heermanwilliamj cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms AT chudnovdaniell cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms AT freedmandavids cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms AT blockjasonp cleaningofanthropometricdatafrompcornetelectronichealthrecordsusingautomatedalgorithms |