Cargando…

Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network

Identifying similar diseases could potentially provide deeper understanding of their underlying causes, and may even hint at possible treatments. For this purpose, it is necessary to have a similarity measure that reflects the underpinning molecular interactions and biological pathways. We have thus...

Descripción completa

Detalles Bibliográficos
Autores principales: Hamaneh, Mehdi Bagheri, Yu, Yi-Kuo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216010/
https://www.ncbi.nlm.nih.gov/pubmed/25360770
http://dx.doi.org/10.1371/journal.pone.0110936
_version_ 1782342192563486720
author Hamaneh, Mehdi Bagheri
Yu, Yi-Kuo
author_facet Hamaneh, Mehdi Bagheri
Yu, Yi-Kuo
author_sort Hamaneh, Mehdi Bagheri
collection PubMed
description Identifying similar diseases could potentially provide deeper understanding of their underlying causes, and may even hint at possible treatments. For this purpose, it is necessary to have a similarity measure that reflects the underpinning molecular interactions and biological pathways. We have thus devised a network-based measure that can partially fulfill this goal. Our method assigns weights to all proteins (and consequently their encoding genes) by using information flow from a disease to the protein interaction network and back. Similarity between two diseases is then defined as the cosine of the angle between their corresponding weight vectors. The proposed method also provides a way to suggest disease-pathway associations by using the weights assigned to the genes to perform enrichment analysis for each disease. By calculating pairwise similarities between 2534 diseases, we show that our disease similarity measure is strongly correlated with the probability of finding the diseases in the same disease family and, more importantly, sharing biological pathways. We have also compared our results to those of MimMiner, a text-mining method that assigns pairwise similarity scores to diseases. We find the results of the two methods to be complementary. It is also shown that clustering diseases based on their similarities and performing enrichment analysis for the cluster centers significantly increases the term association rate, suggesting that the cluster centers are better representatives for biological pathways than the diseases themselves. This lends support to the view that our similarity measure is a good indicator of relatedness of biological processes involved in causing the diseases. Although not needed for understanding this paper, the raw results are available for download for further study at ftp://ftp.ncbi.nlm.nih.gov/pub/qmbpmn/DiseaseRelations/.
format Online
Article
Text
id pubmed-4216010
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42160102014-11-05 Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network Hamaneh, Mehdi Bagheri Yu, Yi-Kuo PLoS One Research Article Identifying similar diseases could potentially provide deeper understanding of their underlying causes, and may even hint at possible treatments. For this purpose, it is necessary to have a similarity measure that reflects the underpinning molecular interactions and biological pathways. We have thus devised a network-based measure that can partially fulfill this goal. Our method assigns weights to all proteins (and consequently their encoding genes) by using information flow from a disease to the protein interaction network and back. Similarity between two diseases is then defined as the cosine of the angle between their corresponding weight vectors. The proposed method also provides a way to suggest disease-pathway associations by using the weights assigned to the genes to perform enrichment analysis for each disease. By calculating pairwise similarities between 2534 diseases, we show that our disease similarity measure is strongly correlated with the probability of finding the diseases in the same disease family and, more importantly, sharing biological pathways. We have also compared our results to those of MimMiner, a text-mining method that assigns pairwise similarity scores to diseases. We find the results of the two methods to be complementary. It is also shown that clustering diseases based on their similarities and performing enrichment analysis for the cluster centers significantly increases the term association rate, suggesting that the cluster centers are better representatives for biological pathways than the diseases themselves. This lends support to the view that our similarity measure is a good indicator of relatedness of biological processes involved in causing the diseases. Although not needed for understanding this paper, the raw results are available for download for further study at ftp://ftp.ncbi.nlm.nih.gov/pub/qmbpmn/DiseaseRelations/. Public Library of Science 2014-10-31 /pmc/articles/PMC4216010/ /pubmed/25360770 http://dx.doi.org/10.1371/journal.pone.0110936 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Hamaneh, Mehdi Bagheri
Yu, Yi-Kuo
Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network
title Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network
title_full Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network
title_fullStr Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network
title_full_unstemmed Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network
title_short Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network
title_sort relating diseases by integrating gene associations and information flow through protein interaction network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216010/
https://www.ncbi.nlm.nih.gov/pubmed/25360770
http://dx.doi.org/10.1371/journal.pone.0110936
work_keys_str_mv AT hamanehmehdibagheri relatingdiseasesbyintegratinggeneassociationsandinformationflowthroughproteininteractionnetwork
AT yuyikuo relatingdiseasesbyintegratinggeneassociationsandinformationflowthroughproteininteractionnetwork