Cargando…

Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information

BACKGROUND: Diabetes mellitus characterized by hyperglycemia as a result of insufficient production of or reduced sensitivity to insulin poses a growing threat to the health of people. It is a heterogeneous disorder with multiple etiologies consisting of type 1 diabetes, type 2 diabetes, gestational...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Xiwei, Hu, Xiaohua, Yang, Xuejun, Fan, Yetian, Li, Yongfan, Hu, Wei, Liao, Yongzhong, Zheng, Ming cai, Peng, Wei, Gao, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001230/
https://www.ncbi.nlm.nih.gov/pubmed/27535125
http://dx.doi.org/10.1186/s12864-016-2795-y
_version_ 1782450435631611904
author Tang, Xiwei
Hu, Xiaohua
Yang, Xuejun
Fan, Yetian
Li, Yongfan
Hu, Wei
Liao, Yongzhong
Zheng, Ming cai
Peng, Wei
Gao, Li
author_facet Tang, Xiwei
Hu, Xiaohua
Yang, Xuejun
Fan, Yetian
Li, Yongfan
Hu, Wei
Liao, Yongzhong
Zheng, Ming cai
Peng, Wei
Gao, Li
author_sort Tang, Xiwei
collection PubMed
description BACKGROUND: Diabetes mellitus characterized by hyperglycemia as a result of insufficient production of or reduced sensitivity to insulin poses a growing threat to the health of people. It is a heterogeneous disorder with multiple etiologies consisting of type 1 diabetes, type 2 diabetes, gestational diabetes and so on. Diabetes-associated protein/gene prediction is a key step to understand the cellular mechanisms related to diabetes mellitus. Compared with experimental methods, computational predictions of candidate proteins/genes are cheaper and more effortless. Protein-protein interaction (PPI) data produced by the high-throughput technology have been used to prioritize candidate disease genes/proteins. However, the false interactions in the PPI data seriously hurt computational methods performance. In order to address that particular question, new methods are developed to identify candidate disease genes/proteins via integrating biological data from other sources. RESULTS: In this study, a new framework called PDMG is proposed to predict candidate disease genes/proteins. First, the weighted networks are building in terms of the combination of the subcellular localization information and PPI data. To form the weighted networks, the importance of each compartment is evaluated based on the number of interacted proteins in this compartment. This is because the very different roles played by different compartments in cell activities. Besides, some compartments are more important than others. Based on the evaluated compartments, the interactions between proteins are scored and the weighted PPI networks are constructed. Second, the known disease genes are extracted from OMIM database as the seed genes to expand disease-specific networks based on the weighted networks. Third, the weighted values between a protein and its neighbors in the disease-related networks are added together and the sum is as the score of the protein. Last but not least, the proteins are ranked based on descending order of their scores. The candidate proteins in the top are considered to be associated with the diseases and are potential disease-related proteins. Various types of data, such as type 2 diabetes-associated genes, subcellular localizations and protein interactions, are used to test PDMG method. CONCLUSIONS: The results show that the proteins/genes functionally exerting a direct influence over diabetes are consistently placed at the head of the queue. PDMG expands and ranks 445 candidate proteins from the seed set including original 27 type 2 diabetes proteins. Out of the top 27 proteins, 14 proteins are the real type 2 diabetes proteins. The literature extracted from the PubMed database has proved that, out of 13 novel proteins, 8 proteins are associated with diabetes.
format Online
Article
Text
id pubmed-5001230
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50012302016-09-06 Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information Tang, Xiwei Hu, Xiaohua Yang, Xuejun Fan, Yetian Li, Yongfan Hu, Wei Liao, Yongzhong Zheng, Ming cai Peng, Wei Gao, Li BMC Genomics Research BACKGROUND: Diabetes mellitus characterized by hyperglycemia as a result of insufficient production of or reduced sensitivity to insulin poses a growing threat to the health of people. It is a heterogeneous disorder with multiple etiologies consisting of type 1 diabetes, type 2 diabetes, gestational diabetes and so on. Diabetes-associated protein/gene prediction is a key step to understand the cellular mechanisms related to diabetes mellitus. Compared with experimental methods, computational predictions of candidate proteins/genes are cheaper and more effortless. Protein-protein interaction (PPI) data produced by the high-throughput technology have been used to prioritize candidate disease genes/proteins. However, the false interactions in the PPI data seriously hurt computational methods performance. In order to address that particular question, new methods are developed to identify candidate disease genes/proteins via integrating biological data from other sources. RESULTS: In this study, a new framework called PDMG is proposed to predict candidate disease genes/proteins. First, the weighted networks are building in terms of the combination of the subcellular localization information and PPI data. To form the weighted networks, the importance of each compartment is evaluated based on the number of interacted proteins in this compartment. This is because the very different roles played by different compartments in cell activities. Besides, some compartments are more important than others. Based on the evaluated compartments, the interactions between proteins are scored and the weighted PPI networks are constructed. Second, the known disease genes are extracted from OMIM database as the seed genes to expand disease-specific networks based on the weighted networks. Third, the weighted values between a protein and its neighbors in the disease-related networks are added together and the sum is as the score of the protein. Last but not least, the proteins are ranked based on descending order of their scores. The candidate proteins in the top are considered to be associated with the diseases and are potential disease-related proteins. Various types of data, such as type 2 diabetes-associated genes, subcellular localizations and protein interactions, are used to test PDMG method. CONCLUSIONS: The results show that the proteins/genes functionally exerting a direct influence over diabetes are consistently placed at the head of the queue. PDMG expands and ranks 445 candidate proteins from the seed set including original 27 type 2 diabetes proteins. Out of the top 27 proteins, 14 proteins are the real type 2 diabetes proteins. The literature extracted from the PubMed database has proved that, out of 13 novel proteins, 8 proteins are associated with diabetes. BioMed Central 2016-08-18 /pmc/articles/PMC5001230/ /pubmed/27535125 http://dx.doi.org/10.1186/s12864-016-2795-y Text en © Tang et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Tang, Xiwei
Hu, Xiaohua
Yang, Xuejun
Fan, Yetian
Li, Yongfan
Hu, Wei
Liao, Yongzhong
Zheng, Ming cai
Peng, Wei
Gao, Li
Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information
title Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information
title_full Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information
title_fullStr Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information
title_full_unstemmed Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information
title_short Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information
title_sort predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5001230/
https://www.ncbi.nlm.nih.gov/pubmed/27535125
http://dx.doi.org/10.1186/s12864-016-2795-y
work_keys_str_mv AT tangxiwei predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation
AT huxiaohua predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation
AT yangxuejun predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation
AT fanyetian predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation
AT liyongfan predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation
AT huwei predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation
AT liaoyongzhong predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation
AT zhengmingcai predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation
AT pengwei predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation
AT gaoli predictingdiabetesmellitusgenesviaproteinproteininteractionandproteinsubcellularlocalizationinformation