Cargando…

Bagging with CTD – A Novel Signature for the Hierarchical Prediction of Secreted Protein Trafficking in Eukaryotes

Protein trafficking or protein sorting in eukaryotes is a complicated process and is carried out based on the information contained in the protein. Many methods reported prediction of the subcellular location of proteins from sequence information. However, most of these prediction methods use a flat...

Descripción completa

Detalles Bibliográficos
Autores principales: Govindan, Geetha, Nair, Achuthsankar S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4357838/
https://www.ncbi.nlm.nih.gov/pubmed/24316328
http://dx.doi.org/10.1016/j.gpb.2013.07.005
_version_ 1782361209013534720
author Govindan, Geetha
Nair, Achuthsankar S.
author_facet Govindan, Geetha
Nair, Achuthsankar S.
author_sort Govindan, Geetha
collection PubMed
description Protein trafficking or protein sorting in eukaryotes is a complicated process and is carried out based on the information contained in the protein. Many methods reported prediction of the subcellular location of proteins from sequence information. However, most of these prediction methods use a flat structure or parallel architecture to perform prediction. In this work, we introduce ensemble classifiers with features that are extracted directly from full length protein sequences to predict locations in the protein-sorting pathway hierarchically. Sequence driven features, sequence mapped features and sequence autocorrelation features were tested with ensemble learners and their performances were compared. When evaluated by independent data testing, ensemble based-bagging algorithms with sequence feature composition, transition and distribution (CTD) successfully classified two datasets with accuracies greater than 90%. We compared our results with similar published methods, and our method equally performed with the others at two levels in the secreted pathway. This study shows that the feature CTD extracted from protein sequences is effective in capturing biological features among compartments in secreted pathways.
format Online
Article
Text
id pubmed-4357838
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-43578382015-05-06 Bagging with CTD – A Novel Signature for the Hierarchical Prediction of Secreted Protein Trafficking in Eukaryotes Govindan, Geetha Nair, Achuthsankar S. Genomics Proteomics Bioinformatics Method Protein trafficking or protein sorting in eukaryotes is a complicated process and is carried out based on the information contained in the protein. Many methods reported prediction of the subcellular location of proteins from sequence information. However, most of these prediction methods use a flat structure or parallel architecture to perform prediction. In this work, we introduce ensemble classifiers with features that are extracted directly from full length protein sequences to predict locations in the protein-sorting pathway hierarchically. Sequence driven features, sequence mapped features and sequence autocorrelation features were tested with ensemble learners and their performances were compared. When evaluated by independent data testing, ensemble based-bagging algorithms with sequence feature composition, transition and distribution (CTD) successfully classified two datasets with accuracies greater than 90%. We compared our results with similar published methods, and our method equally performed with the others at two levels in the secreted pathway. This study shows that the feature CTD extracted from protein sequences is effective in capturing biological features among compartments in secreted pathways. Elsevier 2013-12 2013-12-06 /pmc/articles/PMC4357838/ /pubmed/24316328 http://dx.doi.org/10.1016/j.gpb.2013.07.005 Text en © 2013 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China. Production and hosting by Elsevier B.V. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).
spellingShingle Method
Govindan, Geetha
Nair, Achuthsankar S.
Bagging with CTD – A Novel Signature for the Hierarchical Prediction of Secreted Protein Trafficking in Eukaryotes
title Bagging with CTD – A Novel Signature for the Hierarchical Prediction of Secreted Protein Trafficking in Eukaryotes
title_full Bagging with CTD – A Novel Signature for the Hierarchical Prediction of Secreted Protein Trafficking in Eukaryotes
title_fullStr Bagging with CTD – A Novel Signature for the Hierarchical Prediction of Secreted Protein Trafficking in Eukaryotes
title_full_unstemmed Bagging with CTD – A Novel Signature for the Hierarchical Prediction of Secreted Protein Trafficking in Eukaryotes
title_short Bagging with CTD – A Novel Signature for the Hierarchical Prediction of Secreted Protein Trafficking in Eukaryotes
title_sort bagging with ctd – a novel signature for the hierarchical prediction of secreted protein trafficking in eukaryotes
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4357838/
https://www.ncbi.nlm.nih.gov/pubmed/24316328
http://dx.doi.org/10.1016/j.gpb.2013.07.005
work_keys_str_mv AT govindangeetha baggingwithctdanovelsignatureforthehierarchicalpredictionofsecretedproteintraffickingineukaryotes
AT nairachuthsankars baggingwithctdanovelsignatureforthehierarchicalpredictionofsecretedproteintraffickingineukaryotes