Cargando…
Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information
BACKGROUND: Diabetes mellitus characterized by hyperglycemia as a result of insufficient production of or reduced sensitivity to insulin poses a growing threat to the health of people. It is a heterogeneous disorder with multiple etiologies consisting of type 1 diabetes, type 2 diabetes, gestational...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001230/ https://www.ncbi.nlm.nih.gov/pubmed/27535125 http://dx.doi.org/10.1186/s12864-016-2795-y |
_version_ | 1782450435631611904 |
---|---|
author | Tang, Xiwei Hu, Xiaohua Yang, Xuejun Fan, Yetian Li, Yongfan Hu, Wei Liao, Yongzhong Zheng, Ming cai Peng, Wei Gao, Li |
author_facet | Tang, Xiwei Hu, Xiaohua Yang, Xuejun Fan, Yetian Li, Yongfan Hu, Wei Liao, Yongzhong Zheng, Ming cai Peng, Wei Gao, Li |
author_sort | Tang, Xiwei |
collection | PubMed |
description | BACKGROUND: Diabetes mellitus characterized by hyperglycemia as a result of insufficient production of or reduced sensitivity to insulin poses a growing threat to the health of people. It is a heterogeneous disorder with multiple etiologies consisting of type 1 diabetes, type 2 diabetes, gestational diabetes and so on. Diabetes-associated protein/gene prediction is a key step to understand the cellular mechanisms related to diabetes mellitus. Compared with experimental methods, computational predictions of candidate proteins/genes are cheaper and more effortless. Protein-protein interaction (PPI) data produced by the high-throughput technology have been used to prioritize candidate disease genes/proteins. However, the false interactions in the PPI data seriously hurt computational methods performance. In order to address that particular question, new methods are developed to identify candidate disease genes/proteins via integrating biological data from other sources. RESULTS: In this study, a new framework called PDMG is proposed to predict candidate disease genes/proteins. First, the weighted networks are building in terms of the combination of the subcellular localization information and PPI data. To form the weighted networks, the importance of each compartment is evaluated based on the number of interacted proteins in this compartment. This is because the very different roles played by different compartments in cell activities. Besides, some compartments are more important than others. Based on the evaluated compartments, the interactions between proteins are scored and the weighted PPI networks are constructed. Second, the known disease genes are extracted from OMIM database as the seed genes to expand disease-specific networks based on the weighted networks. Third, the weighted values between a protein and its neighbors in the disease-related networks are added together and the sum is as the score of the protein. Last but not least, the proteins are ranked based on descending order of their scores. The candidate proteins in the top are considered to be associated with the diseases and are potential disease-related proteins. Various types of data, such as type 2 diabetes-associated genes, subcellular localizations and protein interactions, are used to test PDMG method. CONCLUSIONS: The results show that the proteins/genes functionally exerting a direct influence over diabetes are consistently placed at the head of the queue. PDMG expands and ranks 445 candidate proteins from the seed set including original 27 type 2 diabetes proteins. Out of the top 27 proteins, 14 proteins are the real type 2 diabetes proteins. The literature extracted from the PubMed database has proved that, out of 13 novel proteins, 8 proteins are associated with diabetes. |
format | Online Article Text |
id | pubmed-5001230 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50012302016-09-06 Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information Tang, Xiwei Hu, Xiaohua Yang, Xuejun Fan, Yetian Li, Yongfan Hu, Wei Liao, Yongzhong Zheng, Ming cai Peng, Wei Gao, Li BMC Genomics Research BACKGROUND: Diabetes mellitus characterized by hyperglycemia as a result of insufficient production of or reduced sensitivity to insulin poses a growing threat to the health of people. It is a heterogeneous disorder with multiple etiologies consisting of type 1 diabetes, type 2 diabetes, gestational diabetes and so on. Diabetes-associated protein/gene prediction is a key step to understand the cellular mechanisms related to diabetes mellitus. Compared with experimental methods, computational predictions of candidate proteins/genes are cheaper and more effortless. Protein-protein interaction (PPI) data produced by the high-throughput technology have been used to prioritize candidate disease genes/proteins. However, the false interactions in the PPI data seriously hurt computational methods performance. In order to address that particular question, new methods are developed to identify candidate disease genes/proteins via integrating biological data from other sources. RESULTS: In this study, a new framework called PDMG is proposed to predict candidate disease genes/proteins. First, the weighted networks are building in terms of the combination of the subcellular localization information and PPI data. To form the weighted networks, the importance of each compartment is evaluated based on the number of interacted proteins in this compartment. This is because the very different roles played by different compartments in cell activities. Besides, some compartments are more important than others. Based on the evaluated compartments, the interactions between proteins are scored and the weighted PPI networks are constructed. Second, the known disease genes are extracted from OMIM database as the seed genes to expand disease-specific networks based on the weighted networks. Third, the weighted values between a protein and its neighbors in the disease-related networks are added together and the sum is as the score of the protein. Last but not least, the proteins are ranked based on descending order of their scores. The candidate proteins in the top are considered to be associated with the diseases and are potential disease-related proteins. Various types of data, such as type 2 diabetes-associated genes, subcellular localizations and protein interactions, are used to test PDMG method. CONCLUSIONS: The results show that the proteins/genes functionally exerting a direct influence over diabetes are consistently placed at the head of the queue. PDMG expands and ranks 445 candidate proteins from the seed set including original 27 type 2 diabetes proteins. Out of the top 27 proteins, 14 proteins are the real type 2 diabetes proteins. The literature extracted from the PubMed database has proved that, out of 13 novel proteins, 8 proteins are associated with diabetes. BioMed Central 2016-08-18 /pmc/articles/PMC5001230/ /pubmed/27535125 http://dx.doi.org/10.1186/s12864-016-2795-y Text en © Tang et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Tang, Xiwei Hu, Xiaohua Yang, Xuejun Fan, Yetian Li, Yongfan Hu, Wei Liao, Yongzhong Zheng, Ming cai Peng, Wei Gao, Li Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information |
title | Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information |
title_full | Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information |
title_fullStr | Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information |
title_full_unstemmed | Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information |
title_short | Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information |
title_sort | predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001230/ https://www.ncbi.nlm.nih.gov/pubmed/27535125 http://dx.doi.org/10.1186/s12864-016-2795-y |
work_keys_str_mv | AT tangxiwei predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation AT huxiaohua predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation AT yangxuejun predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation AT fanyetian predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation AT liyongfan predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation AT huwei predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation AT liaoyongzhong predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation AT zhengmingcai predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation AT pengwei predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation AT gaoli predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation |