Cargando…

FFPred 3: feature-based function prediction for all Gene Ontology domains

Predicting protein function has been a major goal of bioinformatics for several decades, and it has gained fresh momentum thanks to recent community-wide blind tests aimed at benchmarking available tools on a genomic scale. Sequence-based predictors, especially those performing homology-based transf...

Descripción completa

Detalles Bibliográficos
Autores principales: Cozzetto, Domenico, Minneci, Federico, Currant, Hannah, Jones, David T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4999993/
https://www.ncbi.nlm.nih.gov/pubmed/27561554
http://dx.doi.org/10.1038/srep31865
_version_ 1782450201027411968
author Cozzetto, Domenico
Minneci, Federico
Currant, Hannah
Jones, David T.
author_facet Cozzetto, Domenico
Minneci, Federico
Currant, Hannah
Jones, David T.
author_sort Cozzetto, Domenico
collection PubMed
description Predicting protein function has been a major goal of bioinformatics for several decades, and it has gained fresh momentum thanks to recent community-wide blind tests aimed at benchmarking available tools on a genomic scale. Sequence-based predictors, especially those performing homology-based transfers, remain the most popular but increasing understanding of their limitations has stimulated the development of complementary approaches, which mostly exploit machine learning. Here we present FFPred 3, which is intended for assigning Gene Ontology terms to human protein chains, when homology with characterized proteins can provide little aid. Predictions are made by scanning the input sequences against an array of Support Vector Machines (SVMs), each examining the relationship between protein function and biophysical attributes describing secondary structure, transmembrane helices, intrinsically disordered regions, signal peptides and other motifs. This update features a larger SVM library that extends its coverage to the cellular component sub-ontology for the first time, prompted by the establishment of a dedicated evaluation category within the Critical Assessment of Functional Annotation. The effectiveness of this approach is demonstrated through benchmarking experiments, and its usefulness is illustrated by analysing the potential functional consequences of alternative splicing in human and their relationship to patterns of biological features.
format Online
Article
Text
id pubmed-4999993
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49999932016-09-07 FFPred 3: feature-based function prediction for all Gene Ontology domains Cozzetto, Domenico Minneci, Federico Currant, Hannah Jones, David T. Sci Rep Article Predicting protein function has been a major goal of bioinformatics for several decades, and it has gained fresh momentum thanks to recent community-wide blind tests aimed at benchmarking available tools on a genomic scale. Sequence-based predictors, especially those performing homology-based transfers, remain the most popular but increasing understanding of their limitations has stimulated the development of complementary approaches, which mostly exploit machine learning. Here we present FFPred 3, which is intended for assigning Gene Ontology terms to human protein chains, when homology with characterized proteins can provide little aid. Predictions are made by scanning the input sequences against an array of Support Vector Machines (SVMs), each examining the relationship between protein function and biophysical attributes describing secondary structure, transmembrane helices, intrinsically disordered regions, signal peptides and other motifs. This update features a larger SVM library that extends its coverage to the cellular component sub-ontology for the first time, prompted by the establishment of a dedicated evaluation category within the Critical Assessment of Functional Annotation. The effectiveness of this approach is demonstrated through benchmarking experiments, and its usefulness is illustrated by analysing the potential functional consequences of alternative splicing in human and their relationship to patterns of biological features. Nature Publishing Group 2016-08-26 /pmc/articles/PMC4999993/ /pubmed/27561554 http://dx.doi.org/10.1038/srep31865 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Cozzetto, Domenico
Minneci, Federico
Currant, Hannah
Jones, David T.
FFPred 3: feature-based function prediction for all Gene Ontology domains
title FFPred 3: feature-based function prediction for all Gene Ontology domains
title_full FFPred 3: feature-based function prediction for all Gene Ontology domains
title_fullStr FFPred 3: feature-based function prediction for all Gene Ontology domains
title_full_unstemmed FFPred 3: feature-based function prediction for all Gene Ontology domains
title_short FFPred 3: feature-based function prediction for all Gene Ontology domains
title_sort ffpred 3: feature-based function prediction for all gene ontology domains
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4999993/
https://www.ncbi.nlm.nih.gov/pubmed/27561554
http://dx.doi.org/10.1038/srep31865
work_keys_str_mv AT cozzettodomenico ffpred3featurebasedfunctionpredictionforallgeneontologydomains
AT minnecifederico ffpred3featurebasedfunctionpredictionforallgeneontologydomains
AT curranthannah ffpred3featurebasedfunctionpredictionforallgeneontologydomains
AT jonesdavidt ffpred3featurebasedfunctionpredictionforallgeneontologydomains