Cargando…

DGH-GO: dissecting the genetic heterogeneity of complex diseases using gene ontology

BACKGROUND: Complex diseases such as neurodevelopmental disorders (NDDs) exhibit multiple etiologies. The multi-etiological nature of complex-diseases emerges from distinct but functionally similar group of genes. Different diseases sharing genes of such groups show related clinical outcomes that fu...

Descripción completa

Detalles Bibliográficos
Autores principales: Asif, Muhammad, Martiniano, Hugo F. M. C., Lamurias, Andre, Kausar, Samina, Couto, Francisco M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10134522/
https://www.ncbi.nlm.nih.gov/pubmed/37101154
http://dx.doi.org/10.1186/s12859-023-05290-4
_version_ 1785031781394677760
author Asif, Muhammad
Martiniano, Hugo F. M. C.
Lamurias, Andre
Kausar, Samina
Couto, Francisco M.
author_facet Asif, Muhammad
Martiniano, Hugo F. M. C.
Lamurias, Andre
Kausar, Samina
Couto, Francisco M.
author_sort Asif, Muhammad
collection PubMed
description BACKGROUND: Complex diseases such as neurodevelopmental disorders (NDDs) exhibit multiple etiologies. The multi-etiological nature of complex-diseases emerges from distinct but functionally similar group of genes. Different diseases sharing genes of such groups show related clinical outcomes that further restrict our understanding of disease mechanisms, thus, limiting the applications of personalized medicine approaches to complex genetic disorders. RESULTS: Here, we present an interactive and user-friendly application, called DGH-GO. DGH-GO allows biologists to dissect the genetic heterogeneity of complex diseases by stratifying the putative disease-causing genes into clusters that may contribute to distinct disease outcome development. It can also be used to study the shared etiology of complex-diseases. DGH-GO creates a semantic similarity matrix for the input genes by using Gene Ontology (GO). The resultant matrix can be visualized in 2D plots using different dimension reduction methods (T-SNE, Principal component analysis, umap and Principal coordinate analysis). In the next step, clusters of functionally similar genes are identified from genes functional similarities assessed through GO. This is achieved by employing four different clustering methods (K-means, Hierarchical, Fuzzy and PAM). The user may change the clustering parameters and explore their effect on stratification immediately. DGH-GO was applied to genes disrupted by rare genetic variants in Autism Spectrum Disorder (ASD) patients. The analysis confirmed the multi-etiological nature of ASD by identifying four clusters of genes that were enriched for distinct biological mechanisms and clinical outcome. In the second case study, the analysis of genes shared by different NDDs showed that genes causing multiple disorders tend to aggregate in similar clusters, indicating a possible shared etiology. CONCLUSION: DGH-GO is a user-friendly application that allows biologists to study the multi-etiological nature of complex diseases by dissecting their genetic heterogeneity. In summary, functional similarities, dimension reduction and clustering methods, coupled with interactive visualization and control over analysis allows biologists to explore and analyze their datasets without requiring expert knowledge on these methods. The source code of proposed application is available at https://github.com/Muh-Asif/DGH-GO SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05290-4.
format Online
Article
Text
id pubmed-10134522
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101345222023-04-28 DGH-GO: dissecting the genetic heterogeneity of complex diseases using gene ontology Asif, Muhammad Martiniano, Hugo F. M. C. Lamurias, Andre Kausar, Samina Couto, Francisco M. BMC Bioinformatics Software BACKGROUND: Complex diseases such as neurodevelopmental disorders (NDDs) exhibit multiple etiologies. The multi-etiological nature of complex-diseases emerges from distinct but functionally similar group of genes. Different diseases sharing genes of such groups show related clinical outcomes that further restrict our understanding of disease mechanisms, thus, limiting the applications of personalized medicine approaches to complex genetic disorders. RESULTS: Here, we present an interactive and user-friendly application, called DGH-GO. DGH-GO allows biologists to dissect the genetic heterogeneity of complex diseases by stratifying the putative disease-causing genes into clusters that may contribute to distinct disease outcome development. It can also be used to study the shared etiology of complex-diseases. DGH-GO creates a semantic similarity matrix for the input genes by using Gene Ontology (GO). The resultant matrix can be visualized in 2D plots using different dimension reduction methods (T-SNE, Principal component analysis, umap and Principal coordinate analysis). In the next step, clusters of functionally similar genes are identified from genes functional similarities assessed through GO. This is achieved by employing four different clustering methods (K-means, Hierarchical, Fuzzy and PAM). The user may change the clustering parameters and explore their effect on stratification immediately. DGH-GO was applied to genes disrupted by rare genetic variants in Autism Spectrum Disorder (ASD) patients. The analysis confirmed the multi-etiological nature of ASD by identifying four clusters of genes that were enriched for distinct biological mechanisms and clinical outcome. In the second case study, the analysis of genes shared by different NDDs showed that genes causing multiple disorders tend to aggregate in similar clusters, indicating a possible shared etiology. CONCLUSION: DGH-GO is a user-friendly application that allows biologists to study the multi-etiological nature of complex diseases by dissecting their genetic heterogeneity. In summary, functional similarities, dimension reduction and clustering methods, coupled with interactive visualization and control over analysis allows biologists to explore and analyze their datasets without requiring expert knowledge on these methods. The source code of proposed application is available at https://github.com/Muh-Asif/DGH-GO SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05290-4. BioMed Central 2023-04-26 /pmc/articles/PMC10134522/ /pubmed/37101154 http://dx.doi.org/10.1186/s12859-023-05290-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Asif, Muhammad
Martiniano, Hugo F. M. C.
Lamurias, Andre
Kausar, Samina
Couto, Francisco M.
DGH-GO: dissecting the genetic heterogeneity of complex diseases using gene ontology
title DGH-GO: dissecting the genetic heterogeneity of complex diseases using gene ontology
title_full DGH-GO: dissecting the genetic heterogeneity of complex diseases using gene ontology
title_fullStr DGH-GO: dissecting the genetic heterogeneity of complex diseases using gene ontology
title_full_unstemmed DGH-GO: dissecting the genetic heterogeneity of complex diseases using gene ontology
title_short DGH-GO: dissecting the genetic heterogeneity of complex diseases using gene ontology
title_sort dgh-go: dissecting the genetic heterogeneity of complex diseases using gene ontology
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10134522/
https://www.ncbi.nlm.nih.gov/pubmed/37101154
http://dx.doi.org/10.1186/s12859-023-05290-4
work_keys_str_mv AT asifmuhammad dghgodissectingthegeneticheterogeneityofcomplexdiseasesusinggeneontology
AT martinianohugofmc dghgodissectingthegeneticheterogeneityofcomplexdiseasesusinggeneontology
AT lamuriasandre dghgodissectingthegeneticheterogeneityofcomplexdiseasesusinggeneontology
AT kausarsamina dghgodissectingthegeneticheterogeneityofcomplexdiseasesusinggeneontology
AT coutofranciscom dghgodissectingthegeneticheterogeneityofcomplexdiseasesusinggeneontology