Cargando…

Accurate prediction of protein enzymatic class by N-to-1 Neural Networks

We present a novel ab initio predictor of protein enzymatic class. The predictor can classify proteins, solely based on their sequences, into one of six classes extracted from the enzyme commission (EC) classification scheme and is trained on a large, curated database of over 6,000 non-redundant pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Volpato, Viola, Adelfio, Alessandro, Pollastri, Gianluca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548677/
https://www.ncbi.nlm.nih.gov/pubmed/23368876
http://dx.doi.org/10.1186/1471-2105-14-S1-S11
_version_ 1782256344573673472
author Volpato, Viola
Adelfio, Alessandro
Pollastri, Gianluca
author_facet Volpato, Viola
Adelfio, Alessandro
Pollastri, Gianluca
author_sort Volpato, Viola
collection PubMed
description We present a novel ab initio predictor of protein enzymatic class. The predictor can classify proteins, solely based on their sequences, into one of six classes extracted from the enzyme commission (EC) classification scheme and is trained on a large, curated database of over 6,000 non-redundant proteins which we have assembled in this work. The predictor is powered by an ensemble of N-to-1 Neural Network, a novel architecture which we have recently developed. N-to-1 Neural Networks operate on the full sequence and not on predefined features. All motifs of a predefined length (31 residues in this work) are considered and are compressed by an N-to-1 Neural Network into a feature vector which is automatically determined during training. We test our predictor in 10-fold cross-validation and obtain state of the art results, with a 96% correct classification and 86% generalized correlation. All six classes are predicted with a specificity of at least 80% and false positive rates never exceeding 7%. We are currently investigating enhanced input encoding schemes which include structural information, and are analyzing trained networks to mine motifs that are most informative for the prediction, hence, likely, functionally relevant.
format Online
Article
Text
id pubmed-3548677
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35486772013-02-04 Accurate prediction of protein enzymatic class by N-to-1 Neural Networks Volpato, Viola Adelfio, Alessandro Pollastri, Gianluca BMC Bioinformatics Research We present a novel ab initio predictor of protein enzymatic class. The predictor can classify proteins, solely based on their sequences, into one of six classes extracted from the enzyme commission (EC) classification scheme and is trained on a large, curated database of over 6,000 non-redundant proteins which we have assembled in this work. The predictor is powered by an ensemble of N-to-1 Neural Network, a novel architecture which we have recently developed. N-to-1 Neural Networks operate on the full sequence and not on predefined features. All motifs of a predefined length (31 residues in this work) are considered and are compressed by an N-to-1 Neural Network into a feature vector which is automatically determined during training. We test our predictor in 10-fold cross-validation and obtain state of the art results, with a 96% correct classification and 86% generalized correlation. All six classes are predicted with a specificity of at least 80% and false positive rates never exceeding 7%. We are currently investigating enhanced input encoding schemes which include structural information, and are analyzing trained networks to mine motifs that are most informative for the prediction, hence, likely, functionally relevant. BioMed Central 2013-01-14 /pmc/articles/PMC3548677/ /pubmed/23368876 http://dx.doi.org/10.1186/1471-2105-14-S1-S11 Text en Copyright ©2013 Volpato et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Volpato, Viola
Adelfio, Alessandro
Pollastri, Gianluca
Accurate prediction of protein enzymatic class by N-to-1 Neural Networks
title Accurate prediction of protein enzymatic class by N-to-1 Neural Networks
title_full Accurate prediction of protein enzymatic class by N-to-1 Neural Networks
title_fullStr Accurate prediction of protein enzymatic class by N-to-1 Neural Networks
title_full_unstemmed Accurate prediction of protein enzymatic class by N-to-1 Neural Networks
title_short Accurate prediction of protein enzymatic class by N-to-1 Neural Networks
title_sort accurate prediction of protein enzymatic class by n-to-1 neural networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548677/
https://www.ncbi.nlm.nih.gov/pubmed/23368876
http://dx.doi.org/10.1186/1471-2105-14-S1-S11
work_keys_str_mv AT volpatoviola accuratepredictionofproteinenzymaticclassbynto1neuralnetworks
AT adelfioalessandro accuratepredictionofproteinenzymaticclassbynto1neuralnetworks
AT pollastrigianluca accuratepredictionofproteinenzymaticclassbynto1neuralnetworks