Cargando…

Identification of Bacteriophage Virion Proteins Using Multinomial Naïve Bayes with g-Gap Feature Tree

Bacteriophages, which are tremendously important to the ecology and evolution of bacteria, play a key role in the development of genetic engineering. Bacteriophage virion proteins are essential materials of the infectious viral particles and in charge of several of biological functions. The correct...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Yanyuan, Gao, Hui, Lin, Hao, Liu, Zhen, Tang, Lixia, Li, Songtao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6032154/
https://www.ncbi.nlm.nih.gov/pubmed/29914091
http://dx.doi.org/10.3390/ijms19061779
_version_ 1783337448229568512
author Pan, Yanyuan
Gao, Hui
Lin, Hao
Liu, Zhen
Tang, Lixia
Li, Songtao
author_facet Pan, Yanyuan
Gao, Hui
Lin, Hao
Liu, Zhen
Tang, Lixia
Li, Songtao
author_sort Pan, Yanyuan
collection PubMed
description Bacteriophages, which are tremendously important to the ecology and evolution of bacteria, play a key role in the development of genetic engineering. Bacteriophage virion proteins are essential materials of the infectious viral particles and in charge of several of biological functions. The correct identification of bacteriophage virion proteins is of great importance for understanding both life at the molecular level and genetic evolution. However, few computational methods are available for identifying bacteriophage virion proteins. In this paper, we proposed a new method to predict bacteriophage virion proteins using a Multinomial Naïve Bayes classification model based on discrete feature generated from the g-gap feature tree. The accuracy of the proposed model reaches 98.37% with MCC of 96.27% in 10-fold cross-validation. This result suggests that the proposed method can be a useful approach in identifying bacteriophage virion proteins from sequence information. For the convenience of experimental scientists, a web server (PhagePred) that implements the proposed predictor is available, which can be freely accessed on the Internet.
format Online
Article
Text
id pubmed-6032154
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-60321542018-07-13 Identification of Bacteriophage Virion Proteins Using Multinomial Naïve Bayes with g-Gap Feature Tree Pan, Yanyuan Gao, Hui Lin, Hao Liu, Zhen Tang, Lixia Li, Songtao Int J Mol Sci Article Bacteriophages, which are tremendously important to the ecology and evolution of bacteria, play a key role in the development of genetic engineering. Bacteriophage virion proteins are essential materials of the infectious viral particles and in charge of several of biological functions. The correct identification of bacteriophage virion proteins is of great importance for understanding both life at the molecular level and genetic evolution. However, few computational methods are available for identifying bacteriophage virion proteins. In this paper, we proposed a new method to predict bacteriophage virion proteins using a Multinomial Naïve Bayes classification model based on discrete feature generated from the g-gap feature tree. The accuracy of the proposed model reaches 98.37% with MCC of 96.27% in 10-fold cross-validation. This result suggests that the proposed method can be a useful approach in identifying bacteriophage virion proteins from sequence information. For the convenience of experimental scientists, a web server (PhagePred) that implements the proposed predictor is available, which can be freely accessed on the Internet. MDPI 2018-06-15 /pmc/articles/PMC6032154/ /pubmed/29914091 http://dx.doi.org/10.3390/ijms19061779 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pan, Yanyuan
Gao, Hui
Lin, Hao
Liu, Zhen
Tang, Lixia
Li, Songtao
Identification of Bacteriophage Virion Proteins Using Multinomial Naïve Bayes with g-Gap Feature Tree
title Identification of Bacteriophage Virion Proteins Using Multinomial Naïve Bayes with g-Gap Feature Tree
title_full Identification of Bacteriophage Virion Proteins Using Multinomial Naïve Bayes with g-Gap Feature Tree
title_fullStr Identification of Bacteriophage Virion Proteins Using Multinomial Naïve Bayes with g-Gap Feature Tree
title_full_unstemmed Identification of Bacteriophage Virion Proteins Using Multinomial Naïve Bayes with g-Gap Feature Tree
title_short Identification of Bacteriophage Virion Proteins Using Multinomial Naïve Bayes with g-Gap Feature Tree
title_sort identification of bacteriophage virion proteins using multinomial naïve bayes with g-gap feature tree
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6032154/
https://www.ncbi.nlm.nih.gov/pubmed/29914091
http://dx.doi.org/10.3390/ijms19061779
work_keys_str_mv AT panyanyuan identificationofbacteriophagevirionproteinsusingmultinomialnaivebayeswithggapfeaturetree
AT gaohui identificationofbacteriophagevirionproteinsusingmultinomialnaivebayeswithggapfeaturetree
AT linhao identificationofbacteriophagevirionproteinsusingmultinomialnaivebayeswithggapfeaturetree
AT liuzhen identificationofbacteriophagevirionproteinsusingmultinomialnaivebayeswithggapfeaturetree
AT tanglixia identificationofbacteriophagevirionproteinsusingmultinomialnaivebayeswithggapfeaturetree
AT lisongtao identificationofbacteriophagevirionproteinsusingmultinomialnaivebayeswithggapfeaturetree