Cargando…

A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years

Autism Spectrum Disorder (ASD) diagnosis remains behavior-based and the median age of diagnosis is ~52 months, nearly 5 years after its first-trimester origin. Accurate and clinically-translatable early-age diagnostics do not exist due to ASD genetic and clinical heterogeneity. Here we collected cli...

Descripción completa

Detalles Bibliográficos
Autores principales: Bao, Bokan, Zahiri, Javad, Gazestani, Vahid H., Lopez, Linda, Xiao, Yaqiong, Kim, Raphael, Wen, Teresa H., Chiang, Austin W. T., Nalabolu, Srinivasa, Pierce, Karen, Robasky, Kimberly, Wang, Tianyun, Hoekzema, Kendra, Eichler, Evan E., Lewis, Nathan E., Courchesne, Eric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908553/
https://www.ncbi.nlm.nih.gov/pubmed/36266569
http://dx.doi.org/10.1038/s41380-022-01826-x
_version_ 1784884387143221248
author Bao, Bokan
Zahiri, Javad
Gazestani, Vahid H.
Lopez, Linda
Xiao, Yaqiong
Kim, Raphael
Wen, Teresa H.
Chiang, Austin W. T.
Nalabolu, Srinivasa
Pierce, Karen
Robasky, Kimberly
Wang, Tianyun
Hoekzema, Kendra
Eichler, Evan E.
Lewis, Nathan E.
Courchesne, Eric
author_facet Bao, Bokan
Zahiri, Javad
Gazestani, Vahid H.
Lopez, Linda
Xiao, Yaqiong
Kim, Raphael
Wen, Teresa H.
Chiang, Austin W. T.
Nalabolu, Srinivasa
Pierce, Karen
Robasky, Kimberly
Wang, Tianyun
Hoekzema, Kendra
Eichler, Evan E.
Lewis, Nathan E.
Courchesne, Eric
author_sort Bao, Bokan
collection PubMed
description Autism Spectrum Disorder (ASD) diagnosis remains behavior-based and the median age of diagnosis is ~52 months, nearly 5 years after its first-trimester origin. Accurate and clinically-translatable early-age diagnostics do not exist due to ASD genetic and clinical heterogeneity. Here we collected clinical, diagnostic, and leukocyte RNA data from 240 ASD and typically developing (TD) toddlers (175 toddlers for training and 65 for test). To identify gene expression ASD diagnostic classifiers, we developed 42,840 models composed of 3570 gene expression feature selection sets and 12 classification methods. We found that 742 models had AUC-ROC ≥ 0.8 on both Training and Test sets. Weighted Bayesian model averaging of these 742 models yielded an ensemble classifier model with accurate performance in Training and Test gene expression datasets with ASD diagnostic classification AUC-ROC scores of 85–89% and AUC-PR scores of 84–92%. ASD toddlers with ensemble scores above and below the overall ASD ensemble mean of 0.723 (on a scale of 0 to 1) had similar diagnostic and psychometric scores, but those below this ASD ensemble mean had more prenatal risk events than TD toddlers. Ensemble model feature genes were involved in cell cycle, inflammation/immune response, transcriptional gene regulation, cytokine response, and PI3K-AKT, RAS and Wnt signaling pathways. We additionally collected targeted DNA sequencing smMIPs data on a subset of ASD risk genes from 217 of the 240 ASD and TD toddlers. This DNA sequencing found about the same percentage of SFARI Level 1 and 2 ASD risk gene mutations in TD (12 of 105) as in ASD (13 of 112) toddlers, and classification based only on the presence of mutation in these risk genes performed at a chance level of 49%. By contrast, the leukocyte ensemble gene expression classifier correctly diagnostically classified 88% of TD and ASD toddlers with ASD risk gene mutations. Our ensemble ASD gene expression classifier is diagnostically predictive and replicable across different toddler ages, races, and ethnicities; out-performs a risk gene mutation classifier; and has potential for clinical translation.
format Online
Article
Text
id pubmed-9908553
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-99085532023-02-10 A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years Bao, Bokan Zahiri, Javad Gazestani, Vahid H. Lopez, Linda Xiao, Yaqiong Kim, Raphael Wen, Teresa H. Chiang, Austin W. T. Nalabolu, Srinivasa Pierce, Karen Robasky, Kimberly Wang, Tianyun Hoekzema, Kendra Eichler, Evan E. Lewis, Nathan E. Courchesne, Eric Mol Psychiatry Article Autism Spectrum Disorder (ASD) diagnosis remains behavior-based and the median age of diagnosis is ~52 months, nearly 5 years after its first-trimester origin. Accurate and clinically-translatable early-age diagnostics do not exist due to ASD genetic and clinical heterogeneity. Here we collected clinical, diagnostic, and leukocyte RNA data from 240 ASD and typically developing (TD) toddlers (175 toddlers for training and 65 for test). To identify gene expression ASD diagnostic classifiers, we developed 42,840 models composed of 3570 gene expression feature selection sets and 12 classification methods. We found that 742 models had AUC-ROC ≥ 0.8 on both Training and Test sets. Weighted Bayesian model averaging of these 742 models yielded an ensemble classifier model with accurate performance in Training and Test gene expression datasets with ASD diagnostic classification AUC-ROC scores of 85–89% and AUC-PR scores of 84–92%. ASD toddlers with ensemble scores above and below the overall ASD ensemble mean of 0.723 (on a scale of 0 to 1) had similar diagnostic and psychometric scores, but those below this ASD ensemble mean had more prenatal risk events than TD toddlers. Ensemble model feature genes were involved in cell cycle, inflammation/immune response, transcriptional gene regulation, cytokine response, and PI3K-AKT, RAS and Wnt signaling pathways. We additionally collected targeted DNA sequencing smMIPs data on a subset of ASD risk genes from 217 of the 240 ASD and TD toddlers. This DNA sequencing found about the same percentage of SFARI Level 1 and 2 ASD risk gene mutations in TD (12 of 105) as in ASD (13 of 112) toddlers, and classification based only on the presence of mutation in these risk genes performed at a chance level of 49%. By contrast, the leukocyte ensemble gene expression classifier correctly diagnostically classified 88% of TD and ASD toddlers with ASD risk gene mutations. Our ensemble ASD gene expression classifier is diagnostically predictive and replicable across different toddler ages, races, and ethnicities; out-performs a risk gene mutation classifier; and has potential for clinical translation. Nature Publishing Group UK 2022-10-20 2023 /pmc/articles/PMC9908553/ /pubmed/36266569 http://dx.doi.org/10.1038/s41380-022-01826-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Bao, Bokan
Zahiri, Javad
Gazestani, Vahid H.
Lopez, Linda
Xiao, Yaqiong
Kim, Raphael
Wen, Teresa H.
Chiang, Austin W. T.
Nalabolu, Srinivasa
Pierce, Karen
Robasky, Kimberly
Wang, Tianyun
Hoekzema, Kendra
Eichler, Evan E.
Lewis, Nathan E.
Courchesne, Eric
A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years
title A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years
title_full A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years
title_fullStr A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years
title_full_unstemmed A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years
title_short A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years
title_sort predictive ensemble classifier for the gene expression diagnosis of asd at ages 1 to 4 years
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908553/
https://www.ncbi.nlm.nih.gov/pubmed/36266569
http://dx.doi.org/10.1038/s41380-022-01826-x
work_keys_str_mv AT baobokan apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT zahirijavad apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT gazestanivahidh apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT lopezlinda apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT xiaoyaqiong apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT kimraphael apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT wenteresah apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT chiangaustinwt apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT nalabolusrinivasa apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT piercekaren apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT robaskykimberly apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT wangtianyun apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT hoekzemakendra apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT eichlerevane apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT lewisnathane apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT courchesneeric apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT baobokan predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT zahirijavad predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT gazestanivahidh predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT lopezlinda predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT xiaoyaqiong predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT kimraphael predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT wenteresah predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT chiangaustinwt predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT nalabolusrinivasa predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT piercekaren predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT robaskykimberly predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT wangtianyun predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT hoekzemakendra predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT eichlerevane predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT lewisnathane predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years
AT courchesneeric predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years