Cargando…

PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method

Although, existing methods have been successful in predicting phage (or bacteriophage) virion proteins (PVPs) using various types of protein features and complex classifiers, such as support vector machine and naïve Bayes, these two methods do not allow interpretability. However, the characterizatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Charoenkwan, Phasit, Kanthawong, Sakawrat, Schaduangrat, Nalini, Yana, Janchai, Shoombuatong, Watshara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7072630/
https://www.ncbi.nlm.nih.gov/pubmed/32028709
http://dx.doi.org/10.3390/cells9020353
_version_ 1783506451267846144
author Charoenkwan, Phasit
Kanthawong, Sakawrat
Schaduangrat, Nalini
Yana, Janchai
Shoombuatong, Watshara
author_facet Charoenkwan, Phasit
Kanthawong, Sakawrat
Schaduangrat, Nalini
Yana, Janchai
Shoombuatong, Watshara
author_sort Charoenkwan, Phasit
collection PubMed
description Although, existing methods have been successful in predicting phage (or bacteriophage) virion proteins (PVPs) using various types of protein features and complex classifiers, such as support vector machine and naïve Bayes, these two methods do not allow interpretability. However, the characterization and analysis of PVPs might be of great significance to understanding the molecular mechanisms of bacteriophage genetics and the development of antibacterial drugs. Hence, we herein proposed a novel method (PVPred-SCM) based on the scoring card method (SCM) in conjunction with dipeptide composition to identify and characterize PVPs. In PVPred-SCM, the propensity scores of 400 dipeptides were calculated using the statistical discrimination approach. Rigorous independent validation test showed that PVPred-SCM utilizing only dipeptide composition yielded an accuracy of 77.56%, indicating that PVPred-SCM performed well relative to the state-of-the-art method utilizing a number of protein features. Furthermore, the propensity scores of dipeptides were used to provide insights into the biochemical and biophysical properties of PVPs. Upon comparison, it was found that PVPred-SCM was superior to the existing methods considering its simplicity, interpretability, and implementation. Finally, in an effort to facilitate high-throughput prediction of PVPs, we provided a user-friendly web-server for identifying the likelihood of whether or not these sequences are PVPs. It is anticipated that PVPred-SCM will become a useful tool or at least a complementary existing method for predicting and analyzing PVPs.
format Online
Article
Text
id pubmed-7072630
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-70726302020-03-19 PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method Charoenkwan, Phasit Kanthawong, Sakawrat Schaduangrat, Nalini Yana, Janchai Shoombuatong, Watshara Cells Article Although, existing methods have been successful in predicting phage (or bacteriophage) virion proteins (PVPs) using various types of protein features and complex classifiers, such as support vector machine and naïve Bayes, these two methods do not allow interpretability. However, the characterization and analysis of PVPs might be of great significance to understanding the molecular mechanisms of bacteriophage genetics and the development of antibacterial drugs. Hence, we herein proposed a novel method (PVPred-SCM) based on the scoring card method (SCM) in conjunction with dipeptide composition to identify and characterize PVPs. In PVPred-SCM, the propensity scores of 400 dipeptides were calculated using the statistical discrimination approach. Rigorous independent validation test showed that PVPred-SCM utilizing only dipeptide composition yielded an accuracy of 77.56%, indicating that PVPred-SCM performed well relative to the state-of-the-art method utilizing a number of protein features. Furthermore, the propensity scores of dipeptides were used to provide insights into the biochemical and biophysical properties of PVPs. Upon comparison, it was found that PVPred-SCM was superior to the existing methods considering its simplicity, interpretability, and implementation. Finally, in an effort to facilitate high-throughput prediction of PVPs, we provided a user-friendly web-server for identifying the likelihood of whether or not these sequences are PVPs. It is anticipated that PVPred-SCM will become a useful tool or at least a complementary existing method for predicting and analyzing PVPs. MDPI 2020-02-03 /pmc/articles/PMC7072630/ /pubmed/32028709 http://dx.doi.org/10.3390/cells9020353 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Charoenkwan, Phasit
Kanthawong, Sakawrat
Schaduangrat, Nalini
Yana, Janchai
Shoombuatong, Watshara
PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method
title PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method
title_full PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method
title_fullStr PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method
title_full_unstemmed PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method
title_short PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method
title_sort pvpred-scm: improved prediction and analysis of phage virion proteins using a scoring card method
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7072630/
https://www.ncbi.nlm.nih.gov/pubmed/32028709
http://dx.doi.org/10.3390/cells9020353
work_keys_str_mv AT charoenkwanphasit pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod
AT kanthawongsakawrat pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod
AT schaduangratnalini pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod
AT yanajanchai pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod
AT shoombuatongwatshara pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod