Cargando…

DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization

BACKGROUND: High-throughput molecular interaction data have been used effectively to prioritize candidate genes that are linked to a disease, based on the observation that the products of genes associated with similar diseases are likely to interact with each other heavily in a network of protein-pr...

Descripción completa

Detalles Bibliográficos
Autores principales:	Erten, Sinan, Bebek, Gurkan, Ewing, Rob M, Koyutürk, Mehmet
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3143097/ https://www.ncbi.nlm.nih.gov/pubmed/21699738 http://dx.doi.org/10.1186/1756-0381-4-19

_version_	1782208880922591232
author	Erten, Sinan Bebek, Gurkan Ewing, Rob M Koyutürk, Mehmet
author_facet	Erten, Sinan Bebek, Gurkan Ewing, Rob M Koyutürk, Mehmet
author_sort	Erten, Sinan
collection	PubMed
description	BACKGROUND: High-throughput molecular interaction data have been used effectively to prioritize candidate genes that are linked to a disease, based on the observation that the products of genes associated with similar diseases are likely to interact with each other heavily in a network of protein-protein interactions (PPIs). An important challenge for these applications, however, is the incomplete and noisy nature of PPI data. Information flow based methods alleviate these problems to a certain extent, by considering indirect interactions and multiplicity of paths. RESULTS: We demonstrate that existing methods are likely to favor highly connected genes, making prioritization sensitive to the skewed degree distribution of PPI networks, as well as ascertainment bias in available interaction and disease association data. Motivated by this observation, we propose several statistical adjustment methods to account for the degree distribution of known disease and candidate genes, using a PPI network with associated confidence scores for interactions. We show that the proposed methods can detect loosely connected disease genes that are missed by existing approaches, however, this improvement might come at the price of more false negatives for highly connected genes. Consequently, we develop a suite called DADA, which includes different uniform prioritization methods that effectively integrate existing approaches with the proposed statistical adjustment strategies. Comprehensive experimental results on the Online Mendelian Inheritance in Man (OMIM) database show that DADA outperforms existing methods in prioritizing candidate disease genes. CONCLUSIONS: These results demonstrate the importance of employing accurate statistical models and associated adjustment methods in network-based disease gene prioritization, as well as other network-based functional inference applications. DADA is implemented in Matlab and is freely available at http://compbio.case.edu/dada/.
format	Online Article Text
id	pubmed-3143097
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31430972011-07-26 DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization Erten, Sinan Bebek, Gurkan Ewing, Rob M Koyutürk, Mehmet BioData Min Research BACKGROUND: High-throughput molecular interaction data have been used effectively to prioritize candidate genes that are linked to a disease, based on the observation that the products of genes associated with similar diseases are likely to interact with each other heavily in a network of protein-protein interactions (PPIs). An important challenge for these applications, however, is the incomplete and noisy nature of PPI data. Information flow based methods alleviate these problems to a certain extent, by considering indirect interactions and multiplicity of paths. RESULTS: We demonstrate that existing methods are likely to favor highly connected genes, making prioritization sensitive to the skewed degree distribution of PPI networks, as well as ascertainment bias in available interaction and disease association data. Motivated by this observation, we propose several statistical adjustment methods to account for the degree distribution of known disease and candidate genes, using a PPI network with associated confidence scores for interactions. We show that the proposed methods can detect loosely connected disease genes that are missed by existing approaches, however, this improvement might come at the price of more false negatives for highly connected genes. Consequently, we develop a suite called DADA, which includes different uniform prioritization methods that effectively integrate existing approaches with the proposed statistical adjustment strategies. Comprehensive experimental results on the Online Mendelian Inheritance in Man (OMIM) database show that DADA outperforms existing methods in prioritizing candidate disease genes. CONCLUSIONS: These results demonstrate the importance of employing accurate statistical models and associated adjustment methods in network-based disease gene prioritization, as well as other network-based functional inference applications. DADA is implemented in Matlab and is freely available at http://compbio.case.edu/dada/. BioMed Central 2011-06-24 /pmc/articles/PMC3143097/ /pubmed/21699738 http://dx.doi.org/10.1186/1756-0381-4-19 Text en Copyright ©2011 Erten et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Erten, Sinan Bebek, Gurkan Ewing, Rob M Koyutürk, Mehmet DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization
title	DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization
title_full	DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization
title_fullStr	DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization
title_full_unstemmed	DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization
title_short	DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization
title_sort	dada: degree-aware algorithms for network-based disease gene prioritization
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3143097/ https://www.ncbi.nlm.nih.gov/pubmed/21699738 http://dx.doi.org/10.1186/1756-0381-4-19
work_keys_str_mv	AT ertensinan dadadegreeawarealgorithmsfornetworkbaseddiseasegeneprioritization AT bebekgurkan dadadegreeawarealgorithmsfornetworkbaseddiseasegeneprioritization AT ewingrobm dadadegreeawarealgorithmsfornetworkbaseddiseasegeneprioritization AT koyuturkmehmet dadadegreeawarealgorithmsfornetworkbaseddiseasegeneprioritization

DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization

Ejemplares similares