Cargando…
Network-based association analysis to infer new disease-gene relationships using large-scale protein interactions
Protein-protein interactions integrated with disease-gene associations represent important information for revealing protein functions under disease conditions to improve the prevention, diagnosis, and treatment of complex diseases. Although several studies have attempted to identify disease-gene as...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6021074/ https://www.ncbi.nlm.nih.gov/pubmed/29949603 http://dx.doi.org/10.1371/journal.pone.0199435 |
_version_ | 1783335404675530752 |
---|---|
author | Suratanee, Apichat Plaimas, Kitiporn |
author_facet | Suratanee, Apichat Plaimas, Kitiporn |
author_sort | Suratanee, Apichat |
collection | PubMed |
description | Protein-protein interactions integrated with disease-gene associations represent important information for revealing protein functions under disease conditions to improve the prevention, diagnosis, and treatment of complex diseases. Although several studies have attempted to identify disease-gene associations, the number of possible disease-gene associations is very small. High-throughput technologies have been established experimentally to identify the association between genes and diseases. However, these techniques are still quite expensive, time consuming, and even difficult to perform. Thus, based on currently available data and knowledge, computational methods have served as alternatives to provide more possible associations to increase our understanding of disease mechanisms. Here, a new network-based algorithm, namely, Disease-Gene Association (DGA), was developed to calculate the association score of a query gene to a new possible set of diseases. First, a large-scale protein interaction network was constructed, and the relationship between two interacting proteins was calculated with regard to the disease relationship. Novel plausible disease-gene pairs were identified and statistically scored by our algorithm using neighboring protein information. The results yielded high performance for disease-gene prediction, with an F-measure of 0.78 and an AUC of 0.86. To identify promising candidates of disease-gene associations, the association coverage of genes and diseases were calculated and used with the association score to perform gene and disease selection. Based on gene selection, we identified promising pairs that exhibited evidence related to several important diseases, e.g., inflammation, lipid metabolism, inborn errors, xanthomatosis, cerebellar ataxia, cognitive deterioration, malignant neoplasms of the skin and malignant tumors of the cervix. Focusing on disease selection, we identified target genes that were important to blistering skin diseases and muscular dystrophy. In summary, our developed algorithm is simple, efficiently identifies disease–gene associations in the protein-protein interaction network and provides additional knowledge regarding disease-gene associations. This method can be generalized to other association studies to further advance biomedical science. |
format | Online Article Text |
id | pubmed-6021074 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-60210742018-07-07 Network-based association analysis to infer new disease-gene relationships using large-scale protein interactions Suratanee, Apichat Plaimas, Kitiporn PLoS One Research Article Protein-protein interactions integrated with disease-gene associations represent important information for revealing protein functions under disease conditions to improve the prevention, diagnosis, and treatment of complex diseases. Although several studies have attempted to identify disease-gene associations, the number of possible disease-gene associations is very small. High-throughput technologies have been established experimentally to identify the association between genes and diseases. However, these techniques are still quite expensive, time consuming, and even difficult to perform. Thus, based on currently available data and knowledge, computational methods have served as alternatives to provide more possible associations to increase our understanding of disease mechanisms. Here, a new network-based algorithm, namely, Disease-Gene Association (DGA), was developed to calculate the association score of a query gene to a new possible set of diseases. First, a large-scale protein interaction network was constructed, and the relationship between two interacting proteins was calculated with regard to the disease relationship. Novel plausible disease-gene pairs were identified and statistically scored by our algorithm using neighboring protein information. The results yielded high performance for disease-gene prediction, with an F-measure of 0.78 and an AUC of 0.86. To identify promising candidates of disease-gene associations, the association coverage of genes and diseases were calculated and used with the association score to perform gene and disease selection. Based on gene selection, we identified promising pairs that exhibited evidence related to several important diseases, e.g., inflammation, lipid metabolism, inborn errors, xanthomatosis, cerebellar ataxia, cognitive deterioration, malignant neoplasms of the skin and malignant tumors of the cervix. Focusing on disease selection, we identified target genes that were important to blistering skin diseases and muscular dystrophy. In summary, our developed algorithm is simple, efficiently identifies disease–gene associations in the protein-protein interaction network and provides additional knowledge regarding disease-gene associations. This method can be generalized to other association studies to further advance biomedical science. Public Library of Science 2018-06-27 /pmc/articles/PMC6021074/ /pubmed/29949603 http://dx.doi.org/10.1371/journal.pone.0199435 Text en © 2018 Suratanee, Plaimas http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Suratanee, Apichat Plaimas, Kitiporn Network-based association analysis to infer new disease-gene relationships using large-scale protein interactions |
title | Network-based association analysis to infer new disease-gene relationships using large-scale protein interactions |
title_full | Network-based association analysis to infer new disease-gene relationships using large-scale protein interactions |
title_fullStr | Network-based association analysis to infer new disease-gene relationships using large-scale protein interactions |
title_full_unstemmed | Network-based association analysis to infer new disease-gene relationships using large-scale protein interactions |
title_short | Network-based association analysis to infer new disease-gene relationships using large-scale protein interactions |
title_sort | network-based association analysis to infer new disease-gene relationships using large-scale protein interactions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6021074/ https://www.ncbi.nlm.nih.gov/pubmed/29949603 http://dx.doi.org/10.1371/journal.pone.0199435 |
work_keys_str_mv | AT surataneeapichat networkbasedassociationanalysistoinfernewdiseasegenerelationshipsusinglargescaleproteininteractions AT plaimaskitiporn networkbasedassociationanalysistoinfernewdiseasegenerelationshipsusinglargescaleproteininteractions |