Cargando…
Early detection of autism spectrum disorder in young children with machine learning using medical claims data
OBJECTIVES: Early diagnosis and intervention are keys for improving long-term outcomes of children with autism spectrum disorder (ASD). However, existing screening tools have shown insufficient accuracy. Our objective is to predict the risk of ASD in young children between 18 months and 30 months ba...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BMJ Publishing Group
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9462117/ http://dx.doi.org/10.1136/bmjhci-2022-100544 |
_version_ | 1784787107942760448 |
---|---|
author | Chen, Yu-Hsin Chen, Qiushi Kong, Lan Liu, Guodong |
author_facet | Chen, Yu-Hsin Chen, Qiushi Kong, Lan Liu, Guodong |
author_sort | Chen, Yu-Hsin |
collection | PubMed |
description | OBJECTIVES: Early diagnosis and intervention are keys for improving long-term outcomes of children with autism spectrum disorder (ASD). However, existing screening tools have shown insufficient accuracy. Our objective is to predict the risk of ASD in young children between 18 months and 30 months based on their medical histories using real-world health claims data. METHODS: Using the MarketScan Health Claims Database 2005–2016, we identified 12 743 children with ASD and a random sample of 25 833 children without ASD as our study cohort. We developed logistic regression (LR) with least absolute shrinkage and selection operator and random forest (RF) models for predicting ASD diagnosis at ages of 18–30 months, using demographics, medical diagnoses and healthcare service procedures extracted from individual’s medical claims during early years postbirth as predictor variables. RESULTS: For predicting ASD diagnosis at age of 24 months, the LR and RF models achieved the area under the receiver operating characteristic curve (AUROC) of 0.758 and 0.775, respectively. Prediction accuracy further increased with age. With predictor variables separated by outpatient and inpatient visits, the RF model for prediction at age of 24 months achieved an AUROC of 0.834, with 96.4% specificity and 20.5% positive predictive value at 40% sensitivity, representing a promising improvement over the existing screening tool in practice. CONCLUSIONS: Our study demonstrates the feasibility of using machine learning models and health claims data to identify children with ASD at a very young age. It is deemed a promising approach for monitoring ASD risk in the general children population and early detection of high-risk children for targeted screening. |
format | Online Article Text |
id | pubmed-9462117 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BMJ Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-94621172022-09-14 Early detection of autism spectrum disorder in young children with machine learning using medical claims data Chen, Yu-Hsin Chen, Qiushi Kong, Lan Liu, Guodong BMJ Health Care Inform Original Research OBJECTIVES: Early diagnosis and intervention are keys for improving long-term outcomes of children with autism spectrum disorder (ASD). However, existing screening tools have shown insufficient accuracy. Our objective is to predict the risk of ASD in young children between 18 months and 30 months based on their medical histories using real-world health claims data. METHODS: Using the MarketScan Health Claims Database 2005–2016, we identified 12 743 children with ASD and a random sample of 25 833 children without ASD as our study cohort. We developed logistic regression (LR) with least absolute shrinkage and selection operator and random forest (RF) models for predicting ASD diagnosis at ages of 18–30 months, using demographics, medical diagnoses and healthcare service procedures extracted from individual’s medical claims during early years postbirth as predictor variables. RESULTS: For predicting ASD diagnosis at age of 24 months, the LR and RF models achieved the area under the receiver operating characteristic curve (AUROC) of 0.758 and 0.775, respectively. Prediction accuracy further increased with age. With predictor variables separated by outpatient and inpatient visits, the RF model for prediction at age of 24 months achieved an AUROC of 0.834, with 96.4% specificity and 20.5% positive predictive value at 40% sensitivity, representing a promising improvement over the existing screening tool in practice. CONCLUSIONS: Our study demonstrates the feasibility of using machine learning models and health claims data to identify children with ASD at a very young age. It is deemed a promising approach for monitoring ASD risk in the general children population and early detection of high-risk children for targeted screening. BMJ Publishing Group 2022-09-08 /pmc/articles/PMC9462117/ http://dx.doi.org/10.1136/bmjhci-2022-100544 Text en © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) . |
spellingShingle | Original Research Chen, Yu-Hsin Chen, Qiushi Kong, Lan Liu, Guodong Early detection of autism spectrum disorder in young children with machine learning using medical claims data |
title | Early detection of autism spectrum disorder in young children with machine learning using medical claims data |
title_full | Early detection of autism spectrum disorder in young children with machine learning using medical claims data |
title_fullStr | Early detection of autism spectrum disorder in young children with machine learning using medical claims data |
title_full_unstemmed | Early detection of autism spectrum disorder in young children with machine learning using medical claims data |
title_short | Early detection of autism spectrum disorder in young children with machine learning using medical claims data |
title_sort | early detection of autism spectrum disorder in young children with machine learning using medical claims data |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9462117/ http://dx.doi.org/10.1136/bmjhci-2022-100544 |
work_keys_str_mv | AT chenyuhsin earlydetectionofautismspectrumdisorderinyoungchildrenwithmachinelearningusingmedicalclaimsdata AT chenqiushi earlydetectionofautismspectrumdisorderinyoungchildrenwithmachinelearningusingmedicalclaimsdata AT konglan earlydetectionofautismspectrumdisorderinyoungchildrenwithmachinelearningusingmedicalclaimsdata AT liuguodong earlydetectionofautismspectrumdisorderinyoungchildrenwithmachinelearningusingmedicalclaimsdata |