Cargando…
SCLpredT: Ab initio and homology-based prediction of subcellular localization by N-to-1 neural networks
ABSTRACT: The prediction of protein subcellular localization is a important step towards the prediction of protein function, and considerable effort has gone over the last decade into the development of computational predictors of protein localization. In this article we design a new predictor of pr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3795874/ https://www.ncbi.nlm.nih.gov/pubmed/24133649 http://dx.doi.org/10.1186/2193-1801-2-502 |
_version_ | 1782287436035915776 |
---|---|
author | Adelfio, Alessandro Volpato, Viola Pollastri, Gianluca |
author_facet | Adelfio, Alessandro Volpato, Viola Pollastri, Gianluca |
author_sort | Adelfio, Alessandro |
collection | PubMed |
description | ABSTRACT: The prediction of protein subcellular localization is a important step towards the prediction of protein function, and considerable effort has gone over the last decade into the development of computational predictors of protein localization. In this article we design a new predictor of protein subcellular localization, based on a Machine Learning model (N-to-1 Neural Networks) which we have recently developed. This system, in three versions specialised, respectively, on Plants, Fungi and Animals, has a rich output which incorporates the class “organelle” alongside cytoplasm, nucleus, mitochondria and extracellular, and, additionally, chloroplast in the case of Plants. We investigate the information gain of introducing additional inputs, including predicted secondary structure, and localization information from homologous sequences. To accommodate the latter we design a new algorithm which we present here for the first time. While we do not observe any improvement when including predicted secondary structure, we measure significant overall gains when adding homology information. The final predictor including homology information correctly predicts 74%, 79% and 60% of all proteins in the case of Fungi, Animals and Plants, respectively, and outperforms our previous, state-of-the-art predictor SCLpred, and the popular predictor BaCelLo. We also observe that the contribution of homology information becomes dominant over sequence information for sequence identity values exceeding 50% for Animals and Fungi, and 60% for Plants, confirming that subcellular localization is less conserved than structure. SCLpredT is publicly available at http://distillf.ucd.ie/sclpredt/. Sequence- or template-based predictions can be obtained, and up to 32kbytes of input can be processed in a single submission. |
format | Online Article Text |
id | pubmed-3795874 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-37958742013-10-16 SCLpredT: Ab initio and homology-based prediction of subcellular localization by N-to-1 neural networks Adelfio, Alessandro Volpato, Viola Pollastri, Gianluca Springerplus Research ABSTRACT: The prediction of protein subcellular localization is a important step towards the prediction of protein function, and considerable effort has gone over the last decade into the development of computational predictors of protein localization. In this article we design a new predictor of protein subcellular localization, based on a Machine Learning model (N-to-1 Neural Networks) which we have recently developed. This system, in three versions specialised, respectively, on Plants, Fungi and Animals, has a rich output which incorporates the class “organelle” alongside cytoplasm, nucleus, mitochondria and extracellular, and, additionally, chloroplast in the case of Plants. We investigate the information gain of introducing additional inputs, including predicted secondary structure, and localization information from homologous sequences. To accommodate the latter we design a new algorithm which we present here for the first time. While we do not observe any improvement when including predicted secondary structure, we measure significant overall gains when adding homology information. The final predictor including homology information correctly predicts 74%, 79% and 60% of all proteins in the case of Fungi, Animals and Plants, respectively, and outperforms our previous, state-of-the-art predictor SCLpred, and the popular predictor BaCelLo. We also observe that the contribution of homology information becomes dominant over sequence information for sequence identity values exceeding 50% for Animals and Fungi, and 60% for Plants, confirming that subcellular localization is less conserved than structure. SCLpredT is publicly available at http://distillf.ucd.ie/sclpredt/. Sequence- or template-based predictions can be obtained, and up to 32kbytes of input can be processed in a single submission. Springer International Publishing 2013-10-03 /pmc/articles/PMC3795874/ /pubmed/24133649 http://dx.doi.org/10.1186/2193-1801-2-502 Text en © Adelfio et al.; licensee Springer. 2013 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Adelfio, Alessandro Volpato, Viola Pollastri, Gianluca SCLpredT: Ab initio and homology-based prediction of subcellular localization by N-to-1 neural networks |
title | SCLpredT: Ab initio and homology-based prediction of subcellular localization by N-to-1 neural networks |
title_full | SCLpredT: Ab initio and homology-based prediction of subcellular localization by N-to-1 neural networks |
title_fullStr | SCLpredT: Ab initio and homology-based prediction of subcellular localization by N-to-1 neural networks |
title_full_unstemmed | SCLpredT: Ab initio and homology-based prediction of subcellular localization by N-to-1 neural networks |
title_short | SCLpredT: Ab initio and homology-based prediction of subcellular localization by N-to-1 neural networks |
title_sort | sclpredt: ab initio and homology-based prediction of subcellular localization by n-to-1 neural networks |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3795874/ https://www.ncbi.nlm.nih.gov/pubmed/24133649 http://dx.doi.org/10.1186/2193-1801-2-502 |
work_keys_str_mv | AT adelfioalessandro sclpredtabinitioandhomologybasedpredictionofsubcellularlocalizationbynto1neuralnetworks AT volpatoviola sclpredtabinitioandhomologybasedpredictionofsubcellularlocalizationbynto1neuralnetworks AT pollastrigianluca sclpredtabinitioandhomologybasedpredictionofsubcellularlocalizationbynto1neuralnetworks |