Cargando…
Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder
Autism spectrum disorder (ASD) is a heterogeneous neuropsychiatric disorder with a complex genetic background. Analysis of altered molecular processes in ASD patients requires linear and nonlinear methods that provide interpretable solutions. Interpretable machine learning provides legible models th...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946989/ https://www.ncbi.nlm.nih.gov/pubmed/33719335 http://dx.doi.org/10.3389/fgene.2021.618277 |
_version_ | 1783663149568753664 |
---|---|
author | Garbulowski, Mateusz Smolinska, Karolina Diamanti, Klev Pan, Gang Maqbool, Khurram Feuk, Lars Komorowski, Jan |
author_facet | Garbulowski, Mateusz Smolinska, Karolina Diamanti, Klev Pan, Gang Maqbool, Khurram Feuk, Lars Komorowski, Jan |
author_sort | Garbulowski, Mateusz |
collection | PubMed |
description | Autism spectrum disorder (ASD) is a heterogeneous neuropsychiatric disorder with a complex genetic background. Analysis of altered molecular processes in ASD patients requires linear and nonlinear methods that provide interpretable solutions. Interpretable machine learning provides legible models that allow explaining biological mechanisms and support analysis of clinical subgroups. In this work, we investigated several case-control studies of gene expression measurements of ASD individuals. We constructed a rule-based learning model from three independent datasets that we further visualized as a nonlinear gene-gene co-predictive network. To find dissimilarities between ASD subtypes, we scrutinized a topological structure of the network and estimated a centrality distance. Our analysis revealed that autism is the most severe subtype of ASD, while pervasive developmental disorder-not otherwise specified and Asperger syndrome are closely related and milder ASD subtypes. Furthermore, we analyzed the most important ASD-related features that were described in terms of gene co-predictors. Among others, we found a strong co-predictive mechanism between EMC4 and TMEM30A, which may suggest a co-regulation between these genes. The present study demonstrates the potential of applying interpretable machine learning in bioinformatics analyses. Although the proposed methodology was designed for transcriptomics data, it can be applied to other omics disciplines. |
format | Online Article Text |
id | pubmed-7946989 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79469892021-03-12 Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder Garbulowski, Mateusz Smolinska, Karolina Diamanti, Klev Pan, Gang Maqbool, Khurram Feuk, Lars Komorowski, Jan Front Genet Genetics Autism spectrum disorder (ASD) is a heterogeneous neuropsychiatric disorder with a complex genetic background. Analysis of altered molecular processes in ASD patients requires linear and nonlinear methods that provide interpretable solutions. Interpretable machine learning provides legible models that allow explaining biological mechanisms and support analysis of clinical subgroups. In this work, we investigated several case-control studies of gene expression measurements of ASD individuals. We constructed a rule-based learning model from three independent datasets that we further visualized as a nonlinear gene-gene co-predictive network. To find dissimilarities between ASD subtypes, we scrutinized a topological structure of the network and estimated a centrality distance. Our analysis revealed that autism is the most severe subtype of ASD, while pervasive developmental disorder-not otherwise specified and Asperger syndrome are closely related and milder ASD subtypes. Furthermore, we analyzed the most important ASD-related features that were described in terms of gene co-predictors. Among others, we found a strong co-predictive mechanism between EMC4 and TMEM30A, which may suggest a co-regulation between these genes. The present study demonstrates the potential of applying interpretable machine learning in bioinformatics analyses. Although the proposed methodology was designed for transcriptomics data, it can be applied to other omics disciplines. Frontiers Media S.A. 2021-02-25 /pmc/articles/PMC7946989/ /pubmed/33719335 http://dx.doi.org/10.3389/fgene.2021.618277 Text en Copyright © 2021 Garbulowski, Smolinska, Diamanti, Pan, Maqbool, Feuk and Komorowski. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Garbulowski, Mateusz Smolinska, Karolina Diamanti, Klev Pan, Gang Maqbool, Khurram Feuk, Lars Komorowski, Jan Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder |
title | Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder |
title_full | Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder |
title_fullStr | Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder |
title_full_unstemmed | Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder |
title_short | Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder |
title_sort | interpretable machine learning reveals dissimilarities between subtypes of autism spectrum disorder |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946989/ https://www.ncbi.nlm.nih.gov/pubmed/33719335 http://dx.doi.org/10.3389/fgene.2021.618277 |
work_keys_str_mv | AT garbulowskimateusz interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder AT smolinskakarolina interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder AT diamantiklev interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder AT pangang interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder AT maqboolkhurram interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder AT feuklars interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder AT komorowskijan interpretablemachinelearningrevealsdissimilaritiesbetweensubtypesofautismspectrumdisorder |