Cargando…

BacHbpred: Support Vector Machine Methods for the Prediction of Bacterial Hemoglobin-Like Proteins

The recent upsurge in microbial genome data has revealed that hemoglobin-like (HbL) proteins may be widely distributed among bacteria and that some organisms may carry more than one HbL encoding gene. However, the discovery of HbL proteins has been limited to a small number of bacteria only. This st...

Descripción completa

Detalles Bibliográficos
Autores principales: Selvaraj, MuthuKrishnan, Puri, Munish, Dikshit, Kanak L., Lefevre, Christophe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4789356/
https://www.ncbi.nlm.nih.gov/pubmed/27034664
http://dx.doi.org/10.1155/2016/8150784
_version_ 1782420839042383872
author Selvaraj, MuthuKrishnan
Puri, Munish
Dikshit, Kanak L.
Lefevre, Christophe
author_facet Selvaraj, MuthuKrishnan
Puri, Munish
Dikshit, Kanak L.
Lefevre, Christophe
author_sort Selvaraj, MuthuKrishnan
collection PubMed
description The recent upsurge in microbial genome data has revealed that hemoglobin-like (HbL) proteins may be widely distributed among bacteria and that some organisms may carry more than one HbL encoding gene. However, the discovery of HbL proteins has been limited to a small number of bacteria only. This study describes the prediction of HbL proteins and their domain classification using a machine learning approach. Support vector machine (SVM) models were developed for predicting HbL proteins based upon amino acid composition (AC), dipeptide composition (DC), hybrid method (AC + DC), and position specific scoring matrix (PSSM). In addition, we introduce for the first time a new prediction method based on max to min amino acid residue (MM) profiles. The average accuracy, standard deviation (SD), false positive rate (FPR), confusion matrix, and receiver operating characteristic (ROC) were analyzed. We also compared the performance of our proposed models in homology detection databases. The performance of the different approaches was estimated using fivefold cross-validation techniques. Prediction accuracy was further investigated through confusion matrix and ROC curve analysis. All experimental results indicate that the proposed BacHbpred can be a perspective predictor for determination of HbL related proteins. BacHbpred, a web tool, has been developed for HbL prediction.
format Online
Article
Text
id pubmed-4789356
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-47893562016-03-31 BacHbpred: Support Vector Machine Methods for the Prediction of Bacterial Hemoglobin-Like Proteins Selvaraj, MuthuKrishnan Puri, Munish Dikshit, Kanak L. Lefevre, Christophe Adv Bioinformatics Research Article The recent upsurge in microbial genome data has revealed that hemoglobin-like (HbL) proteins may be widely distributed among bacteria and that some organisms may carry more than one HbL encoding gene. However, the discovery of HbL proteins has been limited to a small number of bacteria only. This study describes the prediction of HbL proteins and their domain classification using a machine learning approach. Support vector machine (SVM) models were developed for predicting HbL proteins based upon amino acid composition (AC), dipeptide composition (DC), hybrid method (AC + DC), and position specific scoring matrix (PSSM). In addition, we introduce for the first time a new prediction method based on max to min amino acid residue (MM) profiles. The average accuracy, standard deviation (SD), false positive rate (FPR), confusion matrix, and receiver operating characteristic (ROC) were analyzed. We also compared the performance of our proposed models in homology detection databases. The performance of the different approaches was estimated using fivefold cross-validation techniques. Prediction accuracy was further investigated through confusion matrix and ROC curve analysis. All experimental results indicate that the proposed BacHbpred can be a perspective predictor for determination of HbL related proteins. BacHbpred, a web tool, has been developed for HbL prediction. Hindawi Publishing Corporation 2016 2016-02-29 /pmc/articles/PMC4789356/ /pubmed/27034664 http://dx.doi.org/10.1155/2016/8150784 Text en Copyright © 2016 MuthuKrishnan Selvaraj et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Selvaraj, MuthuKrishnan
Puri, Munish
Dikshit, Kanak L.
Lefevre, Christophe
BacHbpred: Support Vector Machine Methods for the Prediction of Bacterial Hemoglobin-Like Proteins
title BacHbpred: Support Vector Machine Methods for the Prediction of Bacterial Hemoglobin-Like Proteins
title_full BacHbpred: Support Vector Machine Methods for the Prediction of Bacterial Hemoglobin-Like Proteins
title_fullStr BacHbpred: Support Vector Machine Methods for the Prediction of Bacterial Hemoglobin-Like Proteins
title_full_unstemmed BacHbpred: Support Vector Machine Methods for the Prediction of Bacterial Hemoglobin-Like Proteins
title_short BacHbpred: Support Vector Machine Methods for the Prediction of Bacterial Hemoglobin-Like Proteins
title_sort bachbpred: support vector machine methods for the prediction of bacterial hemoglobin-like proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4789356/
https://www.ncbi.nlm.nih.gov/pubmed/27034664
http://dx.doi.org/10.1155/2016/8150784
work_keys_str_mv AT selvarajmuthukrishnan bachbpredsupportvectormachinemethodsforthepredictionofbacterialhemoglobinlikeproteins
AT purimunish bachbpredsupportvectormachinemethodsforthepredictionofbacterialhemoglobinlikeproteins
AT dikshitkanakl bachbpredsupportvectormachinemethodsforthepredictionofbacterialhemoglobinlikeproteins
AT lefevrechristophe bachbpredsupportvectormachinemethodsforthepredictionofbacterialhemoglobinlikeproteins