Cargando…

Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression

With the growing availability of large-scale biological datasets, automated methods of extracting functionally meaningful information from this data are becoming increasingly important. Data relating to functional association between genes or proteins, such as co-expression or functional association...

Descripción completa

Detalles Bibliográficos
Autores principales: Lehtinen, Sonja, Lees, Jon, Bähler, Jürg, Shawe-Taylor, John, Orengo, Christine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4545790/
https://www.ncbi.nlm.nih.gov/pubmed/26288239
http://dx.doi.org/10.1371/journal.pone.0134668
_version_ 1782386785056194560
author Lehtinen, Sonja
Lees, Jon
Bähler, Jürg
Shawe-Taylor, John
Orengo, Christine
author_facet Lehtinen, Sonja
Lees, Jon
Bähler, Jürg
Shawe-Taylor, John
Orengo, Christine
author_sort Lehtinen, Sonja
collection PubMed
description With the growing availability of large-scale biological datasets, automated methods of extracting functionally meaningful information from this data are becoming increasingly important. Data relating to functional association between genes or proteins, such as co-expression or functional association, is often represented in terms of gene or protein networks. Several methods of predicting gene function from these networks have been proposed. However, evaluating the relative performance of these algorithms may not be trivial: concerns have been raised over biases in different benchmarking methods and datasets, particularly relating to non-independence of functional association data and test data. In this paper we propose a new network-based gene function prediction algorithm using a commute-time kernel and partial least squares regression (Compass). We compare Compass to GeneMANIA, a leading network-based prediction algorithm, using a number of different benchmarks, and find that Compass outperforms GeneMANIA on these benchmarks. We also explicitly explore problems associated with the non-independence of functional association data and test data. We find that a benchmark based on the Gene Ontology database, which, directly or indirectly, incorporates information from other databases, may considerably overestimate the performance of algorithms exploiting functional association data for prediction.
format Online
Article
Text
id pubmed-4545790
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45457902015-09-01 Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression Lehtinen, Sonja Lees, Jon Bähler, Jürg Shawe-Taylor, John Orengo, Christine PLoS One Research Article With the growing availability of large-scale biological datasets, automated methods of extracting functionally meaningful information from this data are becoming increasingly important. Data relating to functional association between genes or proteins, such as co-expression or functional association, is often represented in terms of gene or protein networks. Several methods of predicting gene function from these networks have been proposed. However, evaluating the relative performance of these algorithms may not be trivial: concerns have been raised over biases in different benchmarking methods and datasets, particularly relating to non-independence of functional association data and test data. In this paper we propose a new network-based gene function prediction algorithm using a commute-time kernel and partial least squares regression (Compass). We compare Compass to GeneMANIA, a leading network-based prediction algorithm, using a number of different benchmarks, and find that Compass outperforms GeneMANIA on these benchmarks. We also explicitly explore problems associated with the non-independence of functional association data and test data. We find that a benchmark based on the Gene Ontology database, which, directly or indirectly, incorporates information from other databases, may considerably overestimate the performance of algorithms exploiting functional association data for prediction. Public Library of Science 2015-08-19 /pmc/articles/PMC4545790/ /pubmed/26288239 http://dx.doi.org/10.1371/journal.pone.0134668 Text en © 2015 Lehtinen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lehtinen, Sonja
Lees, Jon
Bähler, Jürg
Shawe-Taylor, John
Orengo, Christine
Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression
title Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression
title_full Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression
title_fullStr Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression
title_full_unstemmed Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression
title_short Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression
title_sort gene function prediction from functional association networks using kernel partial least squares regression
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4545790/
https://www.ncbi.nlm.nih.gov/pubmed/26288239
http://dx.doi.org/10.1371/journal.pone.0134668
work_keys_str_mv AT lehtinensonja genefunctionpredictionfromfunctionalassociationnetworksusingkernelpartialleastsquaresregression
AT leesjon genefunctionpredictionfromfunctionalassociationnetworksusingkernelpartialleastsquaresregression
AT bahlerjurg genefunctionpredictionfromfunctionalassociationnetworksusingkernelpartialleastsquaresregression
AT shawetaylorjohn genefunctionpredictionfromfunctionalassociationnetworksusingkernelpartialleastsquaresregression
AT orengochristine genefunctionpredictionfromfunctionalassociationnetworksusingkernelpartialleastsquaresregression