Cargando…

PSO-LocBact: A Consensus Method for Optimizing Multiple Classifier Results for Predicting the Subcellular Localization of Bacterial Proteins

Several computational approaches for predicting subcellular localization have been developed and proposed. These approaches provide diverse performance because of their different combinations of protein features, training datasets, training strategies, and computational machine learning algorithms....

Descripción completa

Detalles Bibliográficos
Autores principales: Lertampaiporn, Supatcha, Nuannimnoi, Sirapop, Vorapreeda, Tayvich, Chokesajjawatee, Nipa, Visessanguan, Wonnop, Thammarongtham, Chinae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6925685/
https://www.ncbi.nlm.nih.gov/pubmed/31886228
http://dx.doi.org/10.1155/2019/5617153
_version_ 1783481951476252672
author Lertampaiporn, Supatcha
Nuannimnoi, Sirapop
Vorapreeda, Tayvich
Chokesajjawatee, Nipa
Visessanguan, Wonnop
Thammarongtham, Chinae
author_facet Lertampaiporn, Supatcha
Nuannimnoi, Sirapop
Vorapreeda, Tayvich
Chokesajjawatee, Nipa
Visessanguan, Wonnop
Thammarongtham, Chinae
author_sort Lertampaiporn, Supatcha
collection PubMed
description Several computational approaches for predicting subcellular localization have been developed and proposed. These approaches provide diverse performance because of their different combinations of protein features, training datasets, training strategies, and computational machine learning algorithms. In some cases, these tools may yield inconsistent and conflicting prediction results. It is important to consider such conflicting or contradictory predictions from multiple prediction programs during protein annotation, especially in the case of a multiclass classification problem such as subcellular localization. Hence, to address this issue, this work proposes the use of the particle swarm optimization (PSO) algorithm to combine the prediction outputs from multiple different subcellular localization predictors with the aim of integrating diverse prediction models to enhance the final predictions. Herein, we present PSO-LocBact, a consensus classifier based on PSO that can be used to combine the strengths of several preexisting protein localization predictors specially designed for bacteria. Our experimental results indicate that the proposed method can resolve inconsistency problems in subcellular localization prediction for both Gram-negative and Gram-positive bacterial proteins. The average accuracy achieved on each test dataset is over 98%, higher than that achieved with any individual predictor.
format Online
Article
Text
id pubmed-6925685
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-69256852019-12-29 PSO-LocBact: A Consensus Method for Optimizing Multiple Classifier Results for Predicting the Subcellular Localization of Bacterial Proteins Lertampaiporn, Supatcha Nuannimnoi, Sirapop Vorapreeda, Tayvich Chokesajjawatee, Nipa Visessanguan, Wonnop Thammarongtham, Chinae Biomed Res Int Research Article Several computational approaches for predicting subcellular localization have been developed and proposed. These approaches provide diverse performance because of their different combinations of protein features, training datasets, training strategies, and computational machine learning algorithms. In some cases, these tools may yield inconsistent and conflicting prediction results. It is important to consider such conflicting or contradictory predictions from multiple prediction programs during protein annotation, especially in the case of a multiclass classification problem such as subcellular localization. Hence, to address this issue, this work proposes the use of the particle swarm optimization (PSO) algorithm to combine the prediction outputs from multiple different subcellular localization predictors with the aim of integrating diverse prediction models to enhance the final predictions. Herein, we present PSO-LocBact, a consensus classifier based on PSO that can be used to combine the strengths of several preexisting protein localization predictors specially designed for bacteria. Our experimental results indicate that the proposed method can resolve inconsistency problems in subcellular localization prediction for both Gram-negative and Gram-positive bacterial proteins. The average accuracy achieved on each test dataset is over 98%, higher than that achieved with any individual predictor. Hindawi 2019-11-19 /pmc/articles/PMC6925685/ /pubmed/31886228 http://dx.doi.org/10.1155/2019/5617153 Text en Copyright © 2019 Supatcha Lertampaiporn et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lertampaiporn, Supatcha
Nuannimnoi, Sirapop
Vorapreeda, Tayvich
Chokesajjawatee, Nipa
Visessanguan, Wonnop
Thammarongtham, Chinae
PSO-LocBact: A Consensus Method for Optimizing Multiple Classifier Results for Predicting the Subcellular Localization of Bacterial Proteins
title PSO-LocBact: A Consensus Method for Optimizing Multiple Classifier Results for Predicting the Subcellular Localization of Bacterial Proteins
title_full PSO-LocBact: A Consensus Method for Optimizing Multiple Classifier Results for Predicting the Subcellular Localization of Bacterial Proteins
title_fullStr PSO-LocBact: A Consensus Method for Optimizing Multiple Classifier Results for Predicting the Subcellular Localization of Bacterial Proteins
title_full_unstemmed PSO-LocBact: A Consensus Method for Optimizing Multiple Classifier Results for Predicting the Subcellular Localization of Bacterial Proteins
title_short PSO-LocBact: A Consensus Method for Optimizing Multiple Classifier Results for Predicting the Subcellular Localization of Bacterial Proteins
title_sort pso-locbact: a consensus method for optimizing multiple classifier results for predicting the subcellular localization of bacterial proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6925685/
https://www.ncbi.nlm.nih.gov/pubmed/31886228
http://dx.doi.org/10.1155/2019/5617153
work_keys_str_mv AT lertampaipornsupatcha psolocbactaconsensusmethodforoptimizingmultipleclassifierresultsforpredictingthesubcellularlocalizationofbacterialproteins
AT nuannimnoisirapop psolocbactaconsensusmethodforoptimizingmultipleclassifierresultsforpredictingthesubcellularlocalizationofbacterialproteins
AT vorapreedatayvich psolocbactaconsensusmethodforoptimizingmultipleclassifierresultsforpredictingthesubcellularlocalizationofbacterialproteins
AT chokesajjawateenipa psolocbactaconsensusmethodforoptimizingmultipleclassifierresultsforpredictingthesubcellularlocalizationofbacterialproteins
AT visessanguanwonnop psolocbactaconsensusmethodforoptimizingmultipleclassifierresultsforpredictingthesubcellularlocalizationofbacterialproteins
AT thammarongthamchinae psolocbactaconsensusmethodforoptimizingmultipleclassifierresultsforpredictingthesubcellularlocalizationofbacterialproteins