Cargando…

Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features

BACKGROUND: The functioning of a protein relies on its location in the cell. Therefore, predicting protein subcellular localization is an important step towards protein function prediction. Recent studies have shown that relying on Gene Ontology (GO) for feature extraction can improve the prediction...

Descripción completa

Detalles Bibliográficos
Autores principales: Dehzangi, Abdollah, Sohrabi, Sohrab, Heffernan, Rhys, Sharma, Alok, Lyons, James, Paliwal, Kuldip, Sattar, Abdul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4347615/
https://www.ncbi.nlm.nih.gov/pubmed/25734546
http://dx.doi.org/10.1186/1471-2105-16-S4-S1
_version_ 1782359849011511296
author Dehzangi, Abdollah
Sohrabi, Sohrab
Heffernan, Rhys
Sharma, Alok
Lyons, James
Paliwal, Kuldip
Sattar, Abdul
author_facet Dehzangi, Abdollah
Sohrabi, Sohrab
Heffernan, Rhys
Sharma, Alok
Lyons, James
Paliwal, Kuldip
Sattar, Abdul
author_sort Dehzangi, Abdollah
collection PubMed
description BACKGROUND: The functioning of a protein relies on its location in the cell. Therefore, predicting protein subcellular localization is an important step towards protein function prediction. Recent studies have shown that relying on Gene Ontology (GO) for feature extraction can improve the prediction performance. However, for newly sequenced proteins, the GO is not available. Therefore, for these cases, the prediction performance of GO based methods degrade significantly. RESULTS: In this study, we develop a method to effectively employ physicochemical and evolutionary-based information in the protein sequence. To do this, we propose segmentation based feature extraction method to explore potential discriminatory information based on physicochemical properties of the amino acids to tackle Gram-positive and Gram-negative subcellular localization. We explore our proposed feature extraction techniques using 10 attributes that have been experimentally selected among a wide range of physicochemical attributes. Finally by applying the Rotation Forest classification technique to our extracted features, we enhance Gram-positive and Gram-negative subcellular localization accuracies up to 3.4% better than previous studies which used GO for feature extraction. CONCLUSION: By proposing segmentation based feature extraction method to explore potential discriminatory information based on physicochemical properties of the amino acids as well as using Rotation Forest classification technique, we are able to enhance the Gram-positive and Gram-negative subcellular localization prediction accuracies, significantly.
format Online
Article
Text
id pubmed-4347615
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43476152015-03-19 Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features Dehzangi, Abdollah Sohrabi, Sohrab Heffernan, Rhys Sharma, Alok Lyons, James Paliwal, Kuldip Sattar, Abdul BMC Bioinformatics Research BACKGROUND: The functioning of a protein relies on its location in the cell. Therefore, predicting protein subcellular localization is an important step towards protein function prediction. Recent studies have shown that relying on Gene Ontology (GO) for feature extraction can improve the prediction performance. However, for newly sequenced proteins, the GO is not available. Therefore, for these cases, the prediction performance of GO based methods degrade significantly. RESULTS: In this study, we develop a method to effectively employ physicochemical and evolutionary-based information in the protein sequence. To do this, we propose segmentation based feature extraction method to explore potential discriminatory information based on physicochemical properties of the amino acids to tackle Gram-positive and Gram-negative subcellular localization. We explore our proposed feature extraction techniques using 10 attributes that have been experimentally selected among a wide range of physicochemical attributes. Finally by applying the Rotation Forest classification technique to our extracted features, we enhance Gram-positive and Gram-negative subcellular localization accuracies up to 3.4% better than previous studies which used GO for feature extraction. CONCLUSION: By proposing segmentation based feature extraction method to explore potential discriminatory information based on physicochemical properties of the amino acids as well as using Rotation Forest classification technique, we are able to enhance the Gram-positive and Gram-negative subcellular localization prediction accuracies, significantly. BioMed Central 2015-02-23 /pmc/articles/PMC4347615/ /pubmed/25734546 http://dx.doi.org/10.1186/1471-2105-16-S4-S1 Text en Copyright © 2015 Dehzangi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Dehzangi, Abdollah
Sohrabi, Sohrab
Heffernan, Rhys
Sharma, Alok
Lyons, James
Paliwal, Kuldip
Sattar, Abdul
Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
title Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
title_full Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
title_fullStr Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
title_full_unstemmed Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
title_short Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
title_sort gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4347615/
https://www.ncbi.nlm.nih.gov/pubmed/25734546
http://dx.doi.org/10.1186/1471-2105-16-S4-S1
work_keys_str_mv AT dehzangiabdollah grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures
AT sohrabisohrab grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures
AT heffernanrhys grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures
AT sharmaalok grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures
AT lyonsjames grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures
AT paliwalkuldip grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures
AT sattarabdul grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures