Cargando…

Disease candidate gene identification and prioritization using protein interaction networks

BACKGROUND: Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Jing, Aronow, Bruce J, Jegga, Anil G
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2009
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2657789/ https://www.ncbi.nlm.nih.gov/pubmed/19245720 http://dx.doi.org/10.1186/1471-2105-10-73

_version_	1782165615479357440
author	Chen, Jing Aronow, Bruce J Jegga, Anil G
author_facet	Chen, Jing Aronow, Bruce J Jegga, Anil G
author_sort	Chen, Jing
collection	PubMed
description	BACKGROUND: Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN) analyses. RESULTS: For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds"), and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance. CONCLUSION: Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization.
format	Text
id	pubmed-2657789
institution	National Center for Biotechnology Information
language	English
publishDate	2009
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-26577892009-03-19 Disease candidate gene identification and prioritization using protein interaction networks Chen, Jing Aronow, Bruce J Jegga, Anil G BMC Bioinformatics Research Article BACKGROUND: Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN) analyses. RESULTS: For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds"), and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance. CONCLUSION: Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization. BioMed Central 2009-02-27 /pmc/articles/PMC2657789/ /pubmed/19245720 http://dx.doi.org/10.1186/1471-2105-10-73 Text en Copyright © 2009 Chen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Chen, Jing Aronow, Bruce J Jegga, Anil G Disease candidate gene identification and prioritization using protein interaction networks
title	Disease candidate gene identification and prioritization using protein interaction networks
title_full	Disease candidate gene identification and prioritization using protein interaction networks
title_fullStr	Disease candidate gene identification and prioritization using protein interaction networks
title_full_unstemmed	Disease candidate gene identification and prioritization using protein interaction networks
title_short	Disease candidate gene identification and prioritization using protein interaction networks
title_sort	disease candidate gene identification and prioritization using protein interaction networks
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2657789/ https://www.ncbi.nlm.nih.gov/pubmed/19245720 http://dx.doi.org/10.1186/1471-2105-10-73
work_keys_str_mv	AT chenjing diseasecandidategeneidentificationandprioritizationusingproteininteractionnetworks AT aronowbrucej diseasecandidategeneidentificationandprioritizationusingproteininteractionnetworks AT jeggaanilg diseasecandidategeneidentificationandprioritizationusingproteininteractionnetworks

Disease candidate gene identification and prioritization using protein interaction networks

Ejemplares similares