Cargando…

Disease candidate gene identification and prioritization using protein interaction networks

BACKGROUND: Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Jing, Aronow, Bruce J, Jegga, Anil G
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2657789/
https://www.ncbi.nlm.nih.gov/pubmed/19245720
http://dx.doi.org/10.1186/1471-2105-10-73
_version_ 1782165615479357440
author Chen, Jing
Aronow, Bruce J
Jegga, Anil G
author_facet Chen, Jing
Aronow, Bruce J
Jegga, Anil G
author_sort Chen, Jing
collection PubMed
description BACKGROUND: Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN) analyses. RESULTS: For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds"), and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance. CONCLUSION: Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization.
format Text
id pubmed-2657789
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26577892009-03-19 Disease candidate gene identification and prioritization using protein interaction networks Chen, Jing Aronow, Bruce J Jegga, Anil G BMC Bioinformatics Research Article BACKGROUND: Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN) analyses. RESULTS: For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds"), and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance. CONCLUSION: Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization. BioMed Central 2009-02-27 /pmc/articles/PMC2657789/ /pubmed/19245720 http://dx.doi.org/10.1186/1471-2105-10-73 Text en Copyright © 2009 Chen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Jing
Aronow, Bruce J
Jegga, Anil G
Disease candidate gene identification and prioritization using protein interaction networks
title Disease candidate gene identification and prioritization using protein interaction networks
title_full Disease candidate gene identification and prioritization using protein interaction networks
title_fullStr Disease candidate gene identification and prioritization using protein interaction networks
title_full_unstemmed Disease candidate gene identification and prioritization using protein interaction networks
title_short Disease candidate gene identification and prioritization using protein interaction networks
title_sort disease candidate gene identification and prioritization using protein interaction networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2657789/
https://www.ncbi.nlm.nih.gov/pubmed/19245720
http://dx.doi.org/10.1186/1471-2105-10-73
work_keys_str_mv AT chenjing diseasecandidategeneidentificationandprioritizationusingproteininteractionnetworks
AT aronowbrucej diseasecandidategeneidentificationandprioritizationusingproteininteractionnetworks
AT jeggaanilg diseasecandidategeneidentificationandprioritizationusingproteininteractionnetworks