Cargando…

Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods

Accurate identification of phage virion protein is not only a key step for understanding the function of the phage virion protein but also helpful for further understanding the lysis mechanism of the bacterial cell. Since traditional experimental methods are time-consuming and costly for identifying...

Descripción completa

Detalles Bibliográficos
Autores principales: Tan, Jiu-Xin, Dao, Fu-Ying, Lv, Hao, Feng, Peng-Mian, Ding, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6222849/
https://www.ncbi.nlm.nih.gov/pubmed/30103458
http://dx.doi.org/10.3390/molecules23082000
_version_ 1783369302959718400
author Tan, Jiu-Xin
Dao, Fu-Ying
Lv, Hao
Feng, Peng-Mian
Ding, Hui
author_facet Tan, Jiu-Xin
Dao, Fu-Ying
Lv, Hao
Feng, Peng-Mian
Ding, Hui
author_sort Tan, Jiu-Xin
collection PubMed
description Accurate identification of phage virion protein is not only a key step for understanding the function of the phage virion protein but also helpful for further understanding the lysis mechanism of the bacterial cell. Since traditional experimental methods are time-consuming and costly for identifying phage virion proteins, it is extremely urgent to apply machine learning methods to accurately and efficiently identify phage virion proteins. In this work, a support vector machine (SVM) based method was proposed by mixing multiple sets of optimal g-gap dipeptide compositions. The analysis of variance (ANOVA) and the minimal-redundancy-maximal-relevance (mRMR) with an increment feature selection (IFS) were applied to single out the optimal feature set. In the five-fold cross-validation test, the proposed method achieved an overall accuracy of 87.95%. We believe that the proposed method will become an efficient and powerful method for scientists concerning phage virion proteins.
format Online
Article
Text
id pubmed-6222849
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62228492018-11-13 Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods Tan, Jiu-Xin Dao, Fu-Ying Lv, Hao Feng, Peng-Mian Ding, Hui Molecules Article Accurate identification of phage virion protein is not only a key step for understanding the function of the phage virion protein but also helpful for further understanding the lysis mechanism of the bacterial cell. Since traditional experimental methods are time-consuming and costly for identifying phage virion proteins, it is extremely urgent to apply machine learning methods to accurately and efficiently identify phage virion proteins. In this work, a support vector machine (SVM) based method was proposed by mixing multiple sets of optimal g-gap dipeptide compositions. The analysis of variance (ANOVA) and the minimal-redundancy-maximal-relevance (mRMR) with an increment feature selection (IFS) were applied to single out the optimal feature set. In the five-fold cross-validation test, the proposed method achieved an overall accuracy of 87.95%. We believe that the proposed method will become an efficient and powerful method for scientists concerning phage virion proteins. MDPI 2018-08-10 /pmc/articles/PMC6222849/ /pubmed/30103458 http://dx.doi.org/10.3390/molecules23082000 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tan, Jiu-Xin
Dao, Fu-Ying
Lv, Hao
Feng, Peng-Mian
Ding, Hui
Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods
title Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods
title_full Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods
title_fullStr Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods
title_full_unstemmed Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods
title_short Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods
title_sort identifying phage virion proteins by using two-step feature selection methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6222849/
https://www.ncbi.nlm.nih.gov/pubmed/30103458
http://dx.doi.org/10.3390/molecules23082000
work_keys_str_mv AT tanjiuxin identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods
AT daofuying identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods
AT lvhao identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods
AT fengpengmian identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods
AT dinghui identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods