Cargando…

Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder

Autism spectrum disorder (ASD) is a heterogeneous neuropsychiatric disorder with a complex genetic background. Analysis of altered molecular processes in ASD patients requires linear and nonlinear methods that provide interpretable solutions. Interpretable machine learning provides legible models th...

Descripción completa

Detalles Bibliográficos
Autores principales: Garbulowski, Mateusz, Smolinska, Karolina, Diamanti, Klev, Pan, Gang, Maqbool, Khurram, Feuk, Lars, Komorowski, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946989/
https://www.ncbi.nlm.nih.gov/pubmed/33719335
http://dx.doi.org/10.3389/fgene.2021.618277
_version_ 1783663149568753664
author Garbulowski, Mateusz
Smolinska, Karolina
Diamanti, Klev
Pan, Gang
Maqbool, Khurram
Feuk, Lars
Komorowski, Jan
author_facet Garbulowski, Mateusz
Smolinska, Karolina
Diamanti, Klev
Pan, Gang
Maqbool, Khurram
Feuk, Lars
Komorowski, Jan
author_sort Garbulowski, Mateusz
collection PubMed
description Autism spectrum disorder (ASD) is a heterogeneous neuropsychiatric disorder with a complex genetic background. Analysis of altered molecular processes in ASD patients requires linear and nonlinear methods that provide interpretable solutions. Interpretable machine learning provides legible models that allow explaining biological mechanisms and support analysis of clinical subgroups. In this work, we investigated several case-control studies of gene expression measurements of ASD individuals. We constructed a rule-based learning model from three independent datasets that we further visualized as a nonlinear gene-gene co-predictive network. To find dissimilarities between ASD subtypes, we scrutinized a topological structure of the network and estimated a centrality distance. Our analysis revealed that autism is the most severe subtype of ASD, while pervasive developmental disorder-not otherwise specified and Asperger syndrome are closely related and milder ASD subtypes. Furthermore, we analyzed the most important ASD-related features that were described in terms of gene co-predictors. Among others, we found a strong co-predictive mechanism between EMC4 and TMEM30A, which may suggest a co-regulation between these genes. The present study demonstrates the potential of applying interpretable machine learning in bioinformatics analyses. Although the proposed methodology was designed for transcriptomics data, it can be applied to other omics disciplines.
format Online
Article
Text
id pubmed-7946989
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79469892021-03-12 Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder Garbulowski, Mateusz Smolinska, Karolina Diamanti, Klev Pan, Gang Maqbool, Khurram Feuk, Lars Komorowski, Jan Front Genet Genetics Autism spectrum disorder (ASD) is a heterogeneous neuropsychiatric disorder with a complex genetic background. Analysis of altered molecular processes in ASD patients requires linear and nonlinear methods that provide interpretable solutions. Interpretable machine learning provides legible models that allow explaining biological mechanisms and support analysis of clinical subgroups. In this work, we investigated several case-control studies of gene expression measurements of ASD individuals. We constructed a rule-based learning model from three independent datasets that we further visualized as a nonlinear gene-gene co-predictive network. To find dissimilarities between ASD subtypes, we scrutinized a topological structure of the network and estimated a centrality distance. Our analysis revealed that autism is the most severe subtype of ASD, while pervasive developmental disorder-not otherwise specified and Asperger syndrome are closely related and milder ASD subtypes. Furthermore, we analyzed the most important ASD-related features that were described in terms of gene co-predictors. Among others, we found a strong co-predictive mechanism between EMC4 and TMEM30A, which may suggest a co-regulation between these genes. The present study demonstrates the potential of applying interpretable machine learning in bioinformatics analyses. Although the proposed methodology was designed for transcriptomics data, it can be applied to other omics disciplines. Frontiers Media S.A. 2021-02-25 /pmc/articles/PMC7946989/ /pubmed/33719335 http://dx.doi.org/10.3389/fgene.2021.618277 Text en Copyright © 2021 Garbulowski, Smolinska, Diamanti, Pan, Maqbool, Feuk and Komorowski. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Garbulowski, Mateusz
Smolinska, Karolina
Diamanti, Klev
Pan, Gang
Maqbool, Khurram
Feuk, Lars
Komorowski, Jan
Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder
title Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder
title_full Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder
title_fullStr Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder
title_full_unstemmed Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder
title_short Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder
title_sort interpretable machine learning reveals dissimilarities between subtypes of autism spectrum disorder
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946989/
https://www.ncbi.nlm.nih.gov/pubmed/33719335
http://dx.doi.org/10.3389/fgene.2021.618277
work_keys_str_mv AT garbulowskimateusz interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder
AT smolinskakarolina interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder
AT diamantiklev interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder
AT pangang interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder
AT maqboolkhurram interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder
AT feuklars interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder
AT komorowskijan interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder