Cargando…

Predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach

Attention-deficit/hyperactivity disorder (ADHD) is a heterogeneous disorder with a high degree of psychiatric and physical comorbidity, which complicates its diagnosis in childhood and adolescence. We analyzed registry data from 238,696 persons born and living in Sweden between 1995 and 1999. Severa...

Descripción completa

Detalles Bibliográficos
Autores principales: Garcia-Argibay, Miguel, Zhang-James, Yanli, Cortese, Samuele, Lichtenstein, Paul, Larsson, Henrik, Faraone, Stephen V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10005952/
https://www.ncbi.nlm.nih.gov/pubmed/36536075
http://dx.doi.org/10.1038/s41380-022-01918-8
_version_ 1784905203239092224
author Garcia-Argibay, Miguel
Zhang-James, Yanli
Cortese, Samuele
Lichtenstein, Paul
Larsson, Henrik
Faraone, Stephen V.
author_facet Garcia-Argibay, Miguel
Zhang-James, Yanli
Cortese, Samuele
Lichtenstein, Paul
Larsson, Henrik
Faraone, Stephen V.
author_sort Garcia-Argibay, Miguel
collection PubMed
description Attention-deficit/hyperactivity disorder (ADHD) is a heterogeneous disorder with a high degree of psychiatric and physical comorbidity, which complicates its diagnosis in childhood and adolescence. We analyzed registry data from 238,696 persons born and living in Sweden between 1995 and 1999. Several machine learning techniques were used to assess the ability of registry data to inform the diagnosis of ADHD in childhood and adolescence: logistic regression, random Forest, gradient boosting, XGBoost, penalized logistic regression, deep neural network (DNN), and ensemble models. The best fitting model was the DNN, achieving an area under the receiver operating characteristic curve of 0.75, 95% CI (0.74–0.76) and balanced accuracy of 0.69. At the 0.45 probability threshold, sensitivity was 71.66% and specificity was 65.0%. There was an overall agreement in the feature importance among all models (τ > .5). The top 5 features contributing to classification were having a parent with criminal convictions, male sex, having a relative with ADHD, number of academic subjects failed, and speech/learning disabilities. A DNN model predicting childhood and adolescent ADHD trained exclusively on Swedish register data achieved good discrimination. If replicated and validated in an external sample, and proven to be cost-effective, this model could be used to alert clinicians to individuals who ought to be screened for ADHD and to aid clinicians’ decision-making with the goal of decreasing misdiagnoses. Further research is needed to validate results in different populations and to incorporate new predictors.
format Online
Article
Text
id pubmed-10005952
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-100059522023-03-12 Predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach Garcia-Argibay, Miguel Zhang-James, Yanli Cortese, Samuele Lichtenstein, Paul Larsson, Henrik Faraone, Stephen V. Mol Psychiatry Article Attention-deficit/hyperactivity disorder (ADHD) is a heterogeneous disorder with a high degree of psychiatric and physical comorbidity, which complicates its diagnosis in childhood and adolescence. We analyzed registry data from 238,696 persons born and living in Sweden between 1995 and 1999. Several machine learning techniques were used to assess the ability of registry data to inform the diagnosis of ADHD in childhood and adolescence: logistic regression, random Forest, gradient boosting, XGBoost, penalized logistic regression, deep neural network (DNN), and ensemble models. The best fitting model was the DNN, achieving an area under the receiver operating characteristic curve of 0.75, 95% CI (0.74–0.76) and balanced accuracy of 0.69. At the 0.45 probability threshold, sensitivity was 71.66% and specificity was 65.0%. There was an overall agreement in the feature importance among all models (τ > .5). The top 5 features contributing to classification were having a parent with criminal convictions, male sex, having a relative with ADHD, number of academic subjects failed, and speech/learning disabilities. A DNN model predicting childhood and adolescent ADHD trained exclusively on Swedish register data achieved good discrimination. If replicated and validated in an external sample, and proven to be cost-effective, this model could be used to alert clinicians to individuals who ought to be screened for ADHD and to aid clinicians’ decision-making with the goal of decreasing misdiagnoses. Further research is needed to validate results in different populations and to incorporate new predictors. Nature Publishing Group UK 2022-12-19 2023 /pmc/articles/PMC10005952/ /pubmed/36536075 http://dx.doi.org/10.1038/s41380-022-01918-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Garcia-Argibay, Miguel
Zhang-James, Yanli
Cortese, Samuele
Lichtenstein, Paul
Larsson, Henrik
Faraone, Stephen V.
Predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach
title Predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach
title_full Predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach
title_fullStr Predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach
title_full_unstemmed Predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach
title_short Predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach
title_sort predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10005952/
https://www.ncbi.nlm.nih.gov/pubmed/36536075
http://dx.doi.org/10.1038/s41380-022-01918-8
work_keys_str_mv AT garciaargibaymiguel predictingchildhoodandadolescentattentiondeficithyperactivitydisorderonsetanationwidedeeplearningapproach
AT zhangjamesyanli predictingchildhoodandadolescentattentiondeficithyperactivitydisorderonsetanationwidedeeplearningapproach
AT cortesesamuele predictingchildhoodandadolescentattentiondeficithyperactivitydisorderonsetanationwidedeeplearningapproach
AT lichtensteinpaul predictingchildhoodandadolescentattentiondeficithyperactivitydisorderonsetanationwidedeeplearningapproach
AT larssonhenrik predictingchildhoodandadolescentattentiondeficithyperactivitydisorderonsetanationwidedeeplearningapproach
AT faraonestephenv predictingchildhoodandadolescentattentiondeficithyperactivitydisorderonsetanationwidedeeplearningapproach