Cargando…
Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network
Identifying similar diseases could potentially provide deeper understanding of their underlying causes, and may even hint at possible treatments. For this purpose, it is necessary to have a similarity measure that reflects the underpinning molecular interactions and biological pathways. We have thus...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216010/ https://www.ncbi.nlm.nih.gov/pubmed/25360770 http://dx.doi.org/10.1371/journal.pone.0110936 |
_version_ | 1782342192563486720 |
---|---|
author | Hamaneh, Mehdi Bagheri Yu, Yi-Kuo |
author_facet | Hamaneh, Mehdi Bagheri Yu, Yi-Kuo |
author_sort | Hamaneh, Mehdi Bagheri |
collection | PubMed |
description | Identifying similar diseases could potentially provide deeper understanding of their underlying causes, and may even hint at possible treatments. For this purpose, it is necessary to have a similarity measure that reflects the underpinning molecular interactions and biological pathways. We have thus devised a network-based measure that can partially fulfill this goal. Our method assigns weights to all proteins (and consequently their encoding genes) by using information flow from a disease to the protein interaction network and back. Similarity between two diseases is then defined as the cosine of the angle between their corresponding weight vectors. The proposed method also provides a way to suggest disease-pathway associations by using the weights assigned to the genes to perform enrichment analysis for each disease. By calculating pairwise similarities between 2534 diseases, we show that our disease similarity measure is strongly correlated with the probability of finding the diseases in the same disease family and, more importantly, sharing biological pathways. We have also compared our results to those of MimMiner, a text-mining method that assigns pairwise similarity scores to diseases. We find the results of the two methods to be complementary. It is also shown that clustering diseases based on their similarities and performing enrichment analysis for the cluster centers significantly increases the term association rate, suggesting that the cluster centers are better representatives for biological pathways than the diseases themselves. This lends support to the view that our similarity measure is a good indicator of relatedness of biological processes involved in causing the diseases. Although not needed for understanding this paper, the raw results are available for download for further study at ftp://ftp.ncbi.nlm.nih.gov/pub/qmbpmn/DiseaseRelations/. |
format | Online Article Text |
id | pubmed-4216010 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-42160102014-11-05 Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network Hamaneh, Mehdi Bagheri Yu, Yi-Kuo PLoS One Research Article Identifying similar diseases could potentially provide deeper understanding of their underlying causes, and may even hint at possible treatments. For this purpose, it is necessary to have a similarity measure that reflects the underpinning molecular interactions and biological pathways. We have thus devised a network-based measure that can partially fulfill this goal. Our method assigns weights to all proteins (and consequently their encoding genes) by using information flow from a disease to the protein interaction network and back. Similarity between two diseases is then defined as the cosine of the angle between their corresponding weight vectors. The proposed method also provides a way to suggest disease-pathway associations by using the weights assigned to the genes to perform enrichment analysis for each disease. By calculating pairwise similarities between 2534 diseases, we show that our disease similarity measure is strongly correlated with the probability of finding the diseases in the same disease family and, more importantly, sharing biological pathways. We have also compared our results to those of MimMiner, a text-mining method that assigns pairwise similarity scores to diseases. We find the results of the two methods to be complementary. It is also shown that clustering diseases based on their similarities and performing enrichment analysis for the cluster centers significantly increases the term association rate, suggesting that the cluster centers are better representatives for biological pathways than the diseases themselves. This lends support to the view that our similarity measure is a good indicator of relatedness of biological processes involved in causing the diseases. Although not needed for understanding this paper, the raw results are available for download for further study at ftp://ftp.ncbi.nlm.nih.gov/pub/qmbpmn/DiseaseRelations/. Public Library of Science 2014-10-31 /pmc/articles/PMC4216010/ /pubmed/25360770 http://dx.doi.org/10.1371/journal.pone.0110936 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. |
spellingShingle | Research Article Hamaneh, Mehdi Bagheri Yu, Yi-Kuo Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network |
title | Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network |
title_full | Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network |
title_fullStr | Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network |
title_full_unstemmed | Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network |
title_short | Relating Diseases by Integrating Gene Associations and Information Flow through Protein Interaction Network |
title_sort | relating diseases by integrating gene associations and information flow through protein interaction network |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216010/ https://www.ncbi.nlm.nih.gov/pubmed/25360770 http://dx.doi.org/10.1371/journal.pone.0110936 |
work_keys_str_mv | AT hamanehmehdibagheri relatingdiseasesbyintegratinggeneassociationsandinformationflowthroughproteininteractionnetwork AT yuyikuo relatingdiseasesbyintegratinggeneassociationsandinformationflowthroughproteininteractionnetwork |