Cargando…

GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function

BACKGROUND: Most successful computational approaches for protein function prediction integrate multiple genomics and proteomics data sources to make inferences about the function of unknown proteins. The most accurate of these algorithms have long running times, making them unsuitable for real-time...

Descripción completa

Detalles Bibliográficos
Autores principales: Mostafavi, Sara, Ray, Debajyoti, Warde-Farley, David, Grouios, Chris, Morris, Quaid
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447538/
https://www.ncbi.nlm.nih.gov/pubmed/18613948
http://dx.doi.org/10.1186/gb-2008-9-s1-s4
_version_ 1782156962572533760
author Mostafavi, Sara
Ray, Debajyoti
Warde-Farley, David
Grouios, Chris
Morris, Quaid
author_facet Mostafavi, Sara
Ray, Debajyoti
Warde-Farley, David
Grouios, Chris
Morris, Quaid
author_sort Mostafavi, Sara
collection PubMed
description BACKGROUND: Most successful computational approaches for protein function prediction integrate multiple genomics and proteomics data sources to make inferences about the function of unknown proteins. The most accurate of these algorithms have long running times, making them unsuitable for real-time protein function prediction in large genomes. As a result, the predictions of these algorithms are stored in static databases that can easily become outdated. We propose a new algorithm, GeneMANIA, that is as accurate as the leading methods, while capable of predicting protein function in real-time. RESULTS: We use a fast heuristic algorithm, derived from ridge regression, to integrate multiple functional association networks and predict gene function from a single process-specific network using label propagation. Our algorithm is efficient enough to be deployed on a modern webserver and is as accurate as, or more so than, the leading methods on the MouseFunc I benchmark and a new yeast function prediction benchmark; it is robust to redundant and irrelevant data and requires, on average, less than ten seconds of computation time on tasks from these benchmarks. CONCLUSION: GeneMANIA is fast enough to predict gene function on-the-fly while achieving state-of-the-art accuracy. A prototype version of a GeneMANIA-based webserver is available at .
format Text
id pubmed-2447538
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24475382008-07-10 GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function Mostafavi, Sara Ray, Debajyoti Warde-Farley, David Grouios, Chris Morris, Quaid Genome Biol Method BACKGROUND: Most successful computational approaches for protein function prediction integrate multiple genomics and proteomics data sources to make inferences about the function of unknown proteins. The most accurate of these algorithms have long running times, making them unsuitable for real-time protein function prediction in large genomes. As a result, the predictions of these algorithms are stored in static databases that can easily become outdated. We propose a new algorithm, GeneMANIA, that is as accurate as the leading methods, while capable of predicting protein function in real-time. RESULTS: We use a fast heuristic algorithm, derived from ridge regression, to integrate multiple functional association networks and predict gene function from a single process-specific network using label propagation. Our algorithm is efficient enough to be deployed on a modern webserver and is as accurate as, or more so than, the leading methods on the MouseFunc I benchmark and a new yeast function prediction benchmark; it is robust to redundant and irrelevant data and requires, on average, less than ten seconds of computation time on tasks from these benchmarks. CONCLUSION: GeneMANIA is fast enough to predict gene function on-the-fly while achieving state-of-the-art accuracy. A prototype version of a GeneMANIA-based webserver is available at . BioMed Central 2008 2008-06-27 /pmc/articles/PMC2447538/ /pubmed/18613948 http://dx.doi.org/10.1186/gb-2008-9-s1-s4 Text en Copyright © 2008 Mostafavi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method
Mostafavi, Sara
Ray, Debajyoti
Warde-Farley, David
Grouios, Chris
Morris, Quaid
GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function
title GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function
title_full GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function
title_fullStr GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function
title_full_unstemmed GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function
title_short GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function
title_sort genemania: a real-time multiple association network integration algorithm for predicting gene function
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447538/
https://www.ncbi.nlm.nih.gov/pubmed/18613948
http://dx.doi.org/10.1186/gb-2008-9-s1-s4
work_keys_str_mv AT mostafavisara genemaniaarealtimemultipleassociationnetworkintegrationalgorithmforpredictinggenefunction
AT raydebajyoti genemaniaarealtimemultipleassociationnetworkintegrationalgorithmforpredictinggenefunction
AT wardefarleydavid genemaniaarealtimemultipleassociationnetworkintegrationalgorithmforpredictinggenefunction
AT grouioschris genemaniaarealtimemultipleassociationnetworkintegrationalgorithmforpredictinggenefunction
AT morrisquaid genemaniaarealtimemultipleassociationnetworkintegrationalgorithmforpredictinggenefunction