Cargando…
A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years
Autism Spectrum Disorder (ASD) diagnosis remains behavior-based and the median age of diagnosis is ~52 months, nearly 5 years after its first-trimester origin. Accurate and clinically-translatable early-age diagnostics do not exist due to ASD genetic and clinical heterogeneity. Here we collected cli...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908553/ https://www.ncbi.nlm.nih.gov/pubmed/36266569 http://dx.doi.org/10.1038/s41380-022-01826-x |
_version_ | 1784884387143221248 |
---|---|
author | Bao, Bokan Zahiri, Javad Gazestani, Vahid H. Lopez, Linda Xiao, Yaqiong Kim, Raphael Wen, Teresa H. Chiang, Austin W. T. Nalabolu, Srinivasa Pierce, Karen Robasky, Kimberly Wang, Tianyun Hoekzema, Kendra Eichler, Evan E. Lewis, Nathan E. Courchesne, Eric |
author_facet | Bao, Bokan Zahiri, Javad Gazestani, Vahid H. Lopez, Linda Xiao, Yaqiong Kim, Raphael Wen, Teresa H. Chiang, Austin W. T. Nalabolu, Srinivasa Pierce, Karen Robasky, Kimberly Wang, Tianyun Hoekzema, Kendra Eichler, Evan E. Lewis, Nathan E. Courchesne, Eric |
author_sort | Bao, Bokan |
collection | PubMed |
description | Autism Spectrum Disorder (ASD) diagnosis remains behavior-based and the median age of diagnosis is ~52 months, nearly 5 years after its first-trimester origin. Accurate and clinically-translatable early-age diagnostics do not exist due to ASD genetic and clinical heterogeneity. Here we collected clinical, diagnostic, and leukocyte RNA data from 240 ASD and typically developing (TD) toddlers (175 toddlers for training and 65 for test). To identify gene expression ASD diagnostic classifiers, we developed 42,840 models composed of 3570 gene expression feature selection sets and 12 classification methods. We found that 742 models had AUC-ROC ≥ 0.8 on both Training and Test sets. Weighted Bayesian model averaging of these 742 models yielded an ensemble classifier model with accurate performance in Training and Test gene expression datasets with ASD diagnostic classification AUC-ROC scores of 85–89% and AUC-PR scores of 84–92%. ASD toddlers with ensemble scores above and below the overall ASD ensemble mean of 0.723 (on a scale of 0 to 1) had similar diagnostic and psychometric scores, but those below this ASD ensemble mean had more prenatal risk events than TD toddlers. Ensemble model feature genes were involved in cell cycle, inflammation/immune response, transcriptional gene regulation, cytokine response, and PI3K-AKT, RAS and Wnt signaling pathways. We additionally collected targeted DNA sequencing smMIPs data on a subset of ASD risk genes from 217 of the 240 ASD and TD toddlers. This DNA sequencing found about the same percentage of SFARI Level 1 and 2 ASD risk gene mutations in TD (12 of 105) as in ASD (13 of 112) toddlers, and classification based only on the presence of mutation in these risk genes performed at a chance level of 49%. By contrast, the leukocyte ensemble gene expression classifier correctly diagnostically classified 88% of TD and ASD toddlers with ASD risk gene mutations. Our ensemble ASD gene expression classifier is diagnostically predictive and replicable across different toddler ages, races, and ethnicities; out-performs a risk gene mutation classifier; and has potential for clinical translation. |
format | Online Article Text |
id | pubmed-9908553 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-99085532023-02-10 A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years Bao, Bokan Zahiri, Javad Gazestani, Vahid H. Lopez, Linda Xiao, Yaqiong Kim, Raphael Wen, Teresa H. Chiang, Austin W. T. Nalabolu, Srinivasa Pierce, Karen Robasky, Kimberly Wang, Tianyun Hoekzema, Kendra Eichler, Evan E. Lewis, Nathan E. Courchesne, Eric Mol Psychiatry Article Autism Spectrum Disorder (ASD) diagnosis remains behavior-based and the median age of diagnosis is ~52 months, nearly 5 years after its first-trimester origin. Accurate and clinically-translatable early-age diagnostics do not exist due to ASD genetic and clinical heterogeneity. Here we collected clinical, diagnostic, and leukocyte RNA data from 240 ASD and typically developing (TD) toddlers (175 toddlers for training and 65 for test). To identify gene expression ASD diagnostic classifiers, we developed 42,840 models composed of 3570 gene expression feature selection sets and 12 classification methods. We found that 742 models had AUC-ROC ≥ 0.8 on both Training and Test sets. Weighted Bayesian model averaging of these 742 models yielded an ensemble classifier model with accurate performance in Training and Test gene expression datasets with ASD diagnostic classification AUC-ROC scores of 85–89% and AUC-PR scores of 84–92%. ASD toddlers with ensemble scores above and below the overall ASD ensemble mean of 0.723 (on a scale of 0 to 1) had similar diagnostic and psychometric scores, but those below this ASD ensemble mean had more prenatal risk events than TD toddlers. Ensemble model feature genes were involved in cell cycle, inflammation/immune response, transcriptional gene regulation, cytokine response, and PI3K-AKT, RAS and Wnt signaling pathways. We additionally collected targeted DNA sequencing smMIPs data on a subset of ASD risk genes from 217 of the 240 ASD and TD toddlers. This DNA sequencing found about the same percentage of SFARI Level 1 and 2 ASD risk gene mutations in TD (12 of 105) as in ASD (13 of 112) toddlers, and classification based only on the presence of mutation in these risk genes performed at a chance level of 49%. By contrast, the leukocyte ensemble gene expression classifier correctly diagnostically classified 88% of TD and ASD toddlers with ASD risk gene mutations. Our ensemble ASD gene expression classifier is diagnostically predictive and replicable across different toddler ages, races, and ethnicities; out-performs a risk gene mutation classifier; and has potential for clinical translation. Nature Publishing Group UK 2022-10-20 2023 /pmc/articles/PMC9908553/ /pubmed/36266569 http://dx.doi.org/10.1038/s41380-022-01826-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Bao, Bokan Zahiri, Javad Gazestani, Vahid H. Lopez, Linda Xiao, Yaqiong Kim, Raphael Wen, Teresa H. Chiang, Austin W. T. Nalabolu, Srinivasa Pierce, Karen Robasky, Kimberly Wang, Tianyun Hoekzema, Kendra Eichler, Evan E. Lewis, Nathan E. Courchesne, Eric A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years |
title | A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years |
title_full | A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years |
title_fullStr | A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years |
title_full_unstemmed | A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years |
title_short | A predictive ensemble classifier for the gene expression diagnosis of ASD at ages 1 to 4 years |
title_sort | predictive ensemble classifier for the gene expression diagnosis of asd at ages 1 to 4 years |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908553/ https://www.ncbi.nlm.nih.gov/pubmed/36266569 http://dx.doi.org/10.1038/s41380-022-01826-x |
work_keys_str_mv | AT baobokan apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT zahirijavad apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT gazestanivahidh apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT lopezlinda apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT xiaoyaqiong apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT kimraphael apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT wenteresah apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT chiangaustinwt apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT nalabolusrinivasa apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT piercekaren apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT robaskykimberly apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT wangtianyun apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT hoekzemakendra apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT eichlerevane apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT lewisnathane apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT courchesneeric apredictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT baobokan predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT zahirijavad predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT gazestanivahidh predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT lopezlinda predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT xiaoyaqiong predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT kimraphael predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT wenteresah predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT chiangaustinwt predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT nalabolusrinivasa predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT piercekaren predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT robaskykimberly predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT wangtianyun predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT hoekzemakendra predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT eichlerevane predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT lewisnathane predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years AT courchesneeric predictiveensembleclassifierforthegeneexpressiondiagnosisofasdatages1to4years |