Cargando…
Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder
Autism spectrum disorder (ASD) is a heterogeneous condition, characterized by complex genetic architectures and intertwined genetic/environmental interactions. Novel analysis approaches to disentangle its pathophysiology by computing large amounts of data are needed. We present an advanced machine l...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9956345/ https://www.ncbi.nlm.nih.gov/pubmed/36833240 http://dx.doi.org/10.3390/genes14020313 |
_version_ | 1784894569755705344 |
---|---|
author | Di Giovanni, Daniele Enea, Roberto Di Micco, Valentina Benvenuto, Arianna Curatolo, Paolo Emberti Gialloreti, Leonardo |
author_facet | Di Giovanni, Daniele Enea, Roberto Di Micco, Valentina Benvenuto, Arianna Curatolo, Paolo Emberti Gialloreti, Leonardo |
author_sort | Di Giovanni, Daniele |
collection | PubMed |
description | Autism spectrum disorder (ASD) is a heterogeneous condition, characterized by complex genetic architectures and intertwined genetic/environmental interactions. Novel analysis approaches to disentangle its pathophysiology by computing large amounts of data are needed. We present an advanced machine learning technique, based on a clustering analysis on genotypical/phenotypical embedding spaces, to identify biological processes that might act as pathophysiological substrates for ASD. This technique was applied to the VariCarta database, which contained 187,794 variant events retrieved from 15,189 individuals with ASD. Nine clusters of ASD-related genes were identified. The 3 largest clusters included 68.6% of all individuals, consisting of 1455 (38.0%), 841 (21.9%), and 336 (8.7%) persons, respectively. Enrichment analysis was applied to isolate clinically relevant ASD-associated biological processes. Two of the identified clusters were characterized by individuals with an increased presence of variants linked to biological processes and cellular components, such as axon growth and guidance, synaptic membrane components, or transmission. The study also suggested other clusters with possible genotype–phenotype associations. Innovative methodologies, including machine learning, can improve our understanding of the underlying biological processes and gene variant networks that undergo the etiology and pathogenic mechanisms of ASD. Future work to ascertain the reproducibility of the presented methodology is warranted. |
format | Online Article Text |
id | pubmed-9956345 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-99563452023-02-25 Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder Di Giovanni, Daniele Enea, Roberto Di Micco, Valentina Benvenuto, Arianna Curatolo, Paolo Emberti Gialloreti, Leonardo Genes (Basel) Article Autism spectrum disorder (ASD) is a heterogeneous condition, characterized by complex genetic architectures and intertwined genetic/environmental interactions. Novel analysis approaches to disentangle its pathophysiology by computing large amounts of data are needed. We present an advanced machine learning technique, based on a clustering analysis on genotypical/phenotypical embedding spaces, to identify biological processes that might act as pathophysiological substrates for ASD. This technique was applied to the VariCarta database, which contained 187,794 variant events retrieved from 15,189 individuals with ASD. Nine clusters of ASD-related genes were identified. The 3 largest clusters included 68.6% of all individuals, consisting of 1455 (38.0%), 841 (21.9%), and 336 (8.7%) persons, respectively. Enrichment analysis was applied to isolate clinically relevant ASD-associated biological processes. Two of the identified clusters were characterized by individuals with an increased presence of variants linked to biological processes and cellular components, such as axon growth and guidance, synaptic membrane components, or transmission. The study also suggested other clusters with possible genotype–phenotype associations. Innovative methodologies, including machine learning, can improve our understanding of the underlying biological processes and gene variant networks that undergo the etiology and pathogenic mechanisms of ASD. Future work to ascertain the reproducibility of the presented methodology is warranted. MDPI 2023-01-25 /pmc/articles/PMC9956345/ /pubmed/36833240 http://dx.doi.org/10.3390/genes14020313 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Di Giovanni, Daniele Enea, Roberto Di Micco, Valentina Benvenuto, Arianna Curatolo, Paolo Emberti Gialloreti, Leonardo Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder |
title | Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder |
title_full | Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder |
title_fullStr | Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder |
title_full_unstemmed | Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder |
title_short | Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder |
title_sort | using machine learning to explore shared genetic pathways and possible endophenotypes in autism spectrum disorder |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9956345/ https://www.ncbi.nlm.nih.gov/pubmed/36833240 http://dx.doi.org/10.3390/genes14020313 |
work_keys_str_mv | AT digiovannidaniele usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder AT enearoberto usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder AT dimiccovalentina usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder AT benvenutoarianna usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder AT curatolopaolo usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder AT embertigialloretileonardo usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder |