Cargando…
Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features
BACKGROUND: The functioning of a protein relies on its location in the cell. Therefore, predicting protein subcellular localization is an important step towards protein function prediction. Recent studies have shown that relying on Gene Ontology (GO) for feature extraction can improve the prediction...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4347615/ https://www.ncbi.nlm.nih.gov/pubmed/25734546 http://dx.doi.org/10.1186/1471-2105-16-S4-S1 |
_version_ | 1782359849011511296 |
---|---|
author | Dehzangi, Abdollah Sohrabi, Sohrab Heffernan, Rhys Sharma, Alok Lyons, James Paliwal, Kuldip Sattar, Abdul |
author_facet | Dehzangi, Abdollah Sohrabi, Sohrab Heffernan, Rhys Sharma, Alok Lyons, James Paliwal, Kuldip Sattar, Abdul |
author_sort | Dehzangi, Abdollah |
collection | PubMed |
description | BACKGROUND: The functioning of a protein relies on its location in the cell. Therefore, predicting protein subcellular localization is an important step towards protein function prediction. Recent studies have shown that relying on Gene Ontology (GO) for feature extraction can improve the prediction performance. However, for newly sequenced proteins, the GO is not available. Therefore, for these cases, the prediction performance of GO based methods degrade significantly. RESULTS: In this study, we develop a method to effectively employ physicochemical and evolutionary-based information in the protein sequence. To do this, we propose segmentation based feature extraction method to explore potential discriminatory information based on physicochemical properties of the amino acids to tackle Gram-positive and Gram-negative subcellular localization. We explore our proposed feature extraction techniques using 10 attributes that have been experimentally selected among a wide range of physicochemical attributes. Finally by applying the Rotation Forest classification technique to our extracted features, we enhance Gram-positive and Gram-negative subcellular localization accuracies up to 3.4% better than previous studies which used GO for feature extraction. CONCLUSION: By proposing segmentation based feature extraction method to explore potential discriminatory information based on physicochemical properties of the amino acids as well as using Rotation Forest classification technique, we are able to enhance the Gram-positive and Gram-negative subcellular localization prediction accuracies, significantly. |
format | Online Article Text |
id | pubmed-4347615 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43476152015-03-19 Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features Dehzangi, Abdollah Sohrabi, Sohrab Heffernan, Rhys Sharma, Alok Lyons, James Paliwal, Kuldip Sattar, Abdul BMC Bioinformatics Research BACKGROUND: The functioning of a protein relies on its location in the cell. Therefore, predicting protein subcellular localization is an important step towards protein function prediction. Recent studies have shown that relying on Gene Ontology (GO) for feature extraction can improve the prediction performance. However, for newly sequenced proteins, the GO is not available. Therefore, for these cases, the prediction performance of GO based methods degrade significantly. RESULTS: In this study, we develop a method to effectively employ physicochemical and evolutionary-based information in the protein sequence. To do this, we propose segmentation based feature extraction method to explore potential discriminatory information based on physicochemical properties of the amino acids to tackle Gram-positive and Gram-negative subcellular localization. We explore our proposed feature extraction techniques using 10 attributes that have been experimentally selected among a wide range of physicochemical attributes. Finally by applying the Rotation Forest classification technique to our extracted features, we enhance Gram-positive and Gram-negative subcellular localization accuracies up to 3.4% better than previous studies which used GO for feature extraction. CONCLUSION: By proposing segmentation based feature extraction method to explore potential discriminatory information based on physicochemical properties of the amino acids as well as using Rotation Forest classification technique, we are able to enhance the Gram-positive and Gram-negative subcellular localization prediction accuracies, significantly. BioMed Central 2015-02-23 /pmc/articles/PMC4347615/ /pubmed/25734546 http://dx.doi.org/10.1186/1471-2105-16-S4-S1 Text en Copyright © 2015 Dehzangi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Dehzangi, Abdollah Sohrabi, Sohrab Heffernan, Rhys Sharma, Alok Lyons, James Paliwal, Kuldip Sattar, Abdul Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features |
title | Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features |
title_full | Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features |
title_fullStr | Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features |
title_full_unstemmed | Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features |
title_short | Gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features |
title_sort | gram-positive and gram-negative subcellular localization using rotation forest and physicochemical-based features |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4347615/ https://www.ncbi.nlm.nih.gov/pubmed/25734546 http://dx.doi.org/10.1186/1471-2105-16-S4-S1 |
work_keys_str_mv | AT dehzangiabdollah grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures AT sohrabisohrab grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures AT heffernanrhys grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures AT sharmaalok grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures AT lyonsjames grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures AT paliwalkuldip grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures AT sattarabdul grampositiveandgramnegativesubcellularlocalizationusingrotationforestandphysicochemicalbasedfeatures |