Cargando…

Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder

Autism spectrum disorder (ASD) is a heterogeneous condition, characterized by complex genetic architectures and intertwined genetic/environmental interactions. Novel analysis approaches to disentangle its pathophysiology by computing large amounts of data are needed. We present an advanced machine l...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Giovanni, Daniele, Enea, Roberto, Di Micco, Valentina, Benvenuto, Arianna, Curatolo, Paolo, Emberti Gialloreti, Leonardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9956345/
https://www.ncbi.nlm.nih.gov/pubmed/36833240
http://dx.doi.org/10.3390/genes14020313
_version_ 1784894569755705344
author Di Giovanni, Daniele
Enea, Roberto
Di Micco, Valentina
Benvenuto, Arianna
Curatolo, Paolo
Emberti Gialloreti, Leonardo
author_facet Di Giovanni, Daniele
Enea, Roberto
Di Micco, Valentina
Benvenuto, Arianna
Curatolo, Paolo
Emberti Gialloreti, Leonardo
author_sort Di Giovanni, Daniele
collection PubMed
description Autism spectrum disorder (ASD) is a heterogeneous condition, characterized by complex genetic architectures and intertwined genetic/environmental interactions. Novel analysis approaches to disentangle its pathophysiology by computing large amounts of data are needed. We present an advanced machine learning technique, based on a clustering analysis on genotypical/phenotypical embedding spaces, to identify biological processes that might act as pathophysiological substrates for ASD. This technique was applied to the VariCarta database, which contained 187,794 variant events retrieved from 15,189 individuals with ASD. Nine clusters of ASD-related genes were identified. The 3 largest clusters included 68.6% of all individuals, consisting of 1455 (38.0%), 841 (21.9%), and 336 (8.7%) persons, respectively. Enrichment analysis was applied to isolate clinically relevant ASD-associated biological processes. Two of the identified clusters were characterized by individuals with an increased presence of variants linked to biological processes and cellular components, such as axon growth and guidance, synaptic membrane components, or transmission. The study also suggested other clusters with possible genotype–phenotype associations. Innovative methodologies, including machine learning, can improve our understanding of the underlying biological processes and gene variant networks that undergo the etiology and pathogenic mechanisms of ASD. Future work to ascertain the reproducibility of the presented methodology is warranted.
format Online
Article
Text
id pubmed-9956345
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99563452023-02-25 Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder Di Giovanni, Daniele Enea, Roberto Di Micco, Valentina Benvenuto, Arianna Curatolo, Paolo Emberti Gialloreti, Leonardo Genes (Basel) Article Autism spectrum disorder (ASD) is a heterogeneous condition, characterized by complex genetic architectures and intertwined genetic/environmental interactions. Novel analysis approaches to disentangle its pathophysiology by computing large amounts of data are needed. We present an advanced machine learning technique, based on a clustering analysis on genotypical/phenotypical embedding spaces, to identify biological processes that might act as pathophysiological substrates for ASD. This technique was applied to the VariCarta database, which contained 187,794 variant events retrieved from 15,189 individuals with ASD. Nine clusters of ASD-related genes were identified. The 3 largest clusters included 68.6% of all individuals, consisting of 1455 (38.0%), 841 (21.9%), and 336 (8.7%) persons, respectively. Enrichment analysis was applied to isolate clinically relevant ASD-associated biological processes. Two of the identified clusters were characterized by individuals with an increased presence of variants linked to biological processes and cellular components, such as axon growth and guidance, synaptic membrane components, or transmission. The study also suggested other clusters with possible genotype–phenotype associations. Innovative methodologies, including machine learning, can improve our understanding of the underlying biological processes and gene variant networks that undergo the etiology and pathogenic mechanisms of ASD. Future work to ascertain the reproducibility of the presented methodology is warranted. MDPI 2023-01-25 /pmc/articles/PMC9956345/ /pubmed/36833240 http://dx.doi.org/10.3390/genes14020313 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Di Giovanni, Daniele
Enea, Roberto
Di Micco, Valentina
Benvenuto, Arianna
Curatolo, Paolo
Emberti Gialloreti, Leonardo
Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder
title Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder
title_full Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder
title_fullStr Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder
title_full_unstemmed Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder
title_short Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder
title_sort using machine learning to explore shared genetic pathways and possible endophenotypes in autism spectrum disorder
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9956345/
https://www.ncbi.nlm.nih.gov/pubmed/36833240
http://dx.doi.org/10.3390/genes14020313
work_keys_str_mv AT digiovannidaniele usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder
AT enearoberto usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder
AT dimiccovalentina usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder
AT benvenutoarianna usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder
AT curatolopaolo usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder
AT embertigialloretileonardo usingmachinelearningtoexploresharedgeneticpathwaysandpossibleendophenotypesinautismspectrumdisorder