Cargando…
Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods
Accurate identification of phage virion protein is not only a key step for understanding the function of the phage virion protein but also helpful for further understanding the lysis mechanism of the bacterial cell. Since traditional experimental methods are time-consuming and costly for identifying...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6222849/ https://www.ncbi.nlm.nih.gov/pubmed/30103458 http://dx.doi.org/10.3390/molecules23082000 |
_version_ | 1783369302959718400 |
---|---|
author | Tan, Jiu-Xin Dao, Fu-Ying Lv, Hao Feng, Peng-Mian Ding, Hui |
author_facet | Tan, Jiu-Xin Dao, Fu-Ying Lv, Hao Feng, Peng-Mian Ding, Hui |
author_sort | Tan, Jiu-Xin |
collection | PubMed |
description | Accurate identification of phage virion protein is not only a key step for understanding the function of the phage virion protein but also helpful for further understanding the lysis mechanism of the bacterial cell. Since traditional experimental methods are time-consuming and costly for identifying phage virion proteins, it is extremely urgent to apply machine learning methods to accurately and efficiently identify phage virion proteins. In this work, a support vector machine (SVM) based method was proposed by mixing multiple sets of optimal g-gap dipeptide compositions. The analysis of variance (ANOVA) and the minimal-redundancy-maximal-relevance (mRMR) with an increment feature selection (IFS) were applied to single out the optimal feature set. In the five-fold cross-validation test, the proposed method achieved an overall accuracy of 87.95%. We believe that the proposed method will become an efficient and powerful method for scientists concerning phage virion proteins. |
format | Online Article Text |
id | pubmed-6222849 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-62228492018-11-13 Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods Tan, Jiu-Xin Dao, Fu-Ying Lv, Hao Feng, Peng-Mian Ding, Hui Molecules Article Accurate identification of phage virion protein is not only a key step for understanding the function of the phage virion protein but also helpful for further understanding the lysis mechanism of the bacterial cell. Since traditional experimental methods are time-consuming and costly for identifying phage virion proteins, it is extremely urgent to apply machine learning methods to accurately and efficiently identify phage virion proteins. In this work, a support vector machine (SVM) based method was proposed by mixing multiple sets of optimal g-gap dipeptide compositions. The analysis of variance (ANOVA) and the minimal-redundancy-maximal-relevance (mRMR) with an increment feature selection (IFS) were applied to single out the optimal feature set. In the five-fold cross-validation test, the proposed method achieved an overall accuracy of 87.95%. We believe that the proposed method will become an efficient and powerful method for scientists concerning phage virion proteins. MDPI 2018-08-10 /pmc/articles/PMC6222849/ /pubmed/30103458 http://dx.doi.org/10.3390/molecules23082000 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tan, Jiu-Xin Dao, Fu-Ying Lv, Hao Feng, Peng-Mian Ding, Hui Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods |
title | Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods |
title_full | Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods |
title_fullStr | Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods |
title_full_unstemmed | Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods |
title_short | Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods |
title_sort | identifying phage virion proteins by using two-step feature selection methods |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6222849/ https://www.ncbi.nlm.nih.gov/pubmed/30103458 http://dx.doi.org/10.3390/molecules23082000 |
work_keys_str_mv | AT tanjiuxin identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods AT daofuying identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods AT lvhao identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods AT fengpengmian identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods AT dinghui identifyingphagevirionproteinsbyusingtwostepfeatureselectionmethods |