Cargando…
PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method
Although, existing methods have been successful in predicting phage (or bacteriophage) virion proteins (PVPs) using various types of protein features and complex classifiers, such as support vector machine and naïve Bayes, these two methods do not allow interpretability. However, the characterizatio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7072630/ https://www.ncbi.nlm.nih.gov/pubmed/32028709 http://dx.doi.org/10.3390/cells9020353 |
_version_ | 1783506451267846144 |
---|---|
author | Charoenkwan, Phasit Kanthawong, Sakawrat Schaduangrat, Nalini Yana, Janchai Shoombuatong, Watshara |
author_facet | Charoenkwan, Phasit Kanthawong, Sakawrat Schaduangrat, Nalini Yana, Janchai Shoombuatong, Watshara |
author_sort | Charoenkwan, Phasit |
collection | PubMed |
description | Although, existing methods have been successful in predicting phage (or bacteriophage) virion proteins (PVPs) using various types of protein features and complex classifiers, such as support vector machine and naïve Bayes, these two methods do not allow interpretability. However, the characterization and analysis of PVPs might be of great significance to understanding the molecular mechanisms of bacteriophage genetics and the development of antibacterial drugs. Hence, we herein proposed a novel method (PVPred-SCM) based on the scoring card method (SCM) in conjunction with dipeptide composition to identify and characterize PVPs. In PVPred-SCM, the propensity scores of 400 dipeptides were calculated using the statistical discrimination approach. Rigorous independent validation test showed that PVPred-SCM utilizing only dipeptide composition yielded an accuracy of 77.56%, indicating that PVPred-SCM performed well relative to the state-of-the-art method utilizing a number of protein features. Furthermore, the propensity scores of dipeptides were used to provide insights into the biochemical and biophysical properties of PVPs. Upon comparison, it was found that PVPred-SCM was superior to the existing methods considering its simplicity, interpretability, and implementation. Finally, in an effort to facilitate high-throughput prediction of PVPs, we provided a user-friendly web-server for identifying the likelihood of whether or not these sequences are PVPs. It is anticipated that PVPred-SCM will become a useful tool or at least a complementary existing method for predicting and analyzing PVPs. |
format | Online Article Text |
id | pubmed-7072630 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-70726302020-03-19 PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method Charoenkwan, Phasit Kanthawong, Sakawrat Schaduangrat, Nalini Yana, Janchai Shoombuatong, Watshara Cells Article Although, existing methods have been successful in predicting phage (or bacteriophage) virion proteins (PVPs) using various types of protein features and complex classifiers, such as support vector machine and naïve Bayes, these two methods do not allow interpretability. However, the characterization and analysis of PVPs might be of great significance to understanding the molecular mechanisms of bacteriophage genetics and the development of antibacterial drugs. Hence, we herein proposed a novel method (PVPred-SCM) based on the scoring card method (SCM) in conjunction with dipeptide composition to identify and characterize PVPs. In PVPred-SCM, the propensity scores of 400 dipeptides were calculated using the statistical discrimination approach. Rigorous independent validation test showed that PVPred-SCM utilizing only dipeptide composition yielded an accuracy of 77.56%, indicating that PVPred-SCM performed well relative to the state-of-the-art method utilizing a number of protein features. Furthermore, the propensity scores of dipeptides were used to provide insights into the biochemical and biophysical properties of PVPs. Upon comparison, it was found that PVPred-SCM was superior to the existing methods considering its simplicity, interpretability, and implementation. Finally, in an effort to facilitate high-throughput prediction of PVPs, we provided a user-friendly web-server for identifying the likelihood of whether or not these sequences are PVPs. It is anticipated that PVPred-SCM will become a useful tool or at least a complementary existing method for predicting and analyzing PVPs. MDPI 2020-02-03 /pmc/articles/PMC7072630/ /pubmed/32028709 http://dx.doi.org/10.3390/cells9020353 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Charoenkwan, Phasit Kanthawong, Sakawrat Schaduangrat, Nalini Yana, Janchai Shoombuatong, Watshara PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method |
title | PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method |
title_full | PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method |
title_fullStr | PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method |
title_full_unstemmed | PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method |
title_short | PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method |
title_sort | pvpred-scm: improved prediction and analysis of phage virion proteins using a scoring card method |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7072630/ https://www.ncbi.nlm.nih.gov/pubmed/32028709 http://dx.doi.org/10.3390/cells9020353 |
work_keys_str_mv | AT charoenkwanphasit pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod AT kanthawongsakawrat pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod AT schaduangratnalini pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod AT yanajanchai pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod AT shoombuatongwatshara pvpredscmimprovedpredictionandanalysisofphagevirionproteinsusingascoringcardmethod |