Cargando…
RF_phage virion: Classification of phage virion proteins with a random forest model
Introduction: Phages play essential roles in biological procession, and the virion proteins encoded by the phage genome constitute critical elements of the assembled phage particle. Methods: This study uses machine learning methods to classify phage virion proteins. We proposed a novel approach, RF_...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945117/ https://www.ncbi.nlm.nih.gov/pubmed/36846294 http://dx.doi.org/10.3389/fgene.2022.1103783 |
_version_ | 1784892070473760768 |
---|---|
author | Zhang, Yanqin Li, Zhiyuan |
author_facet | Zhang, Yanqin Li, Zhiyuan |
author_sort | Zhang, Yanqin |
collection | PubMed |
description | Introduction: Phages play essential roles in biological procession, and the virion proteins encoded by the phage genome constitute critical elements of the assembled phage particle. Methods: This study uses machine learning methods to classify phage virion proteins. We proposed a novel approach, RF_phage virion, for the effective classification of the virion and non-virion proteins. The model uses four protein sequence coding methods as features, and the random forest algorithm was employed to solve the classification problem. Results: The performance of the RF_phage virion model was analyzed by comparing the performance of this algorithm with that of classical machine learning methods. The proposed method achieved a specificity (Sp) of 93.37%%, sensitivity (Sn) of 90.30%, accuracy (Acc) of 91.84%, Matthews correlation coefficient (MCC) of .8371, and an F1 score of .9196. |
format | Online Article Text |
id | pubmed-9945117 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-99451172023-02-23 RF_phage virion: Classification of phage virion proteins with a random forest model Zhang, Yanqin Li, Zhiyuan Front Genet Genetics Introduction: Phages play essential roles in biological procession, and the virion proteins encoded by the phage genome constitute critical elements of the assembled phage particle. Methods: This study uses machine learning methods to classify phage virion proteins. We proposed a novel approach, RF_phage virion, for the effective classification of the virion and non-virion proteins. The model uses four protein sequence coding methods as features, and the random forest algorithm was employed to solve the classification problem. Results: The performance of the RF_phage virion model was analyzed by comparing the performance of this algorithm with that of classical machine learning methods. The proposed method achieved a specificity (Sp) of 93.37%%, sensitivity (Sn) of 90.30%, accuracy (Acc) of 91.84%, Matthews correlation coefficient (MCC) of .8371, and an F1 score of .9196. Frontiers Media S.A. 2023-02-08 /pmc/articles/PMC9945117/ /pubmed/36846294 http://dx.doi.org/10.3389/fgene.2022.1103783 Text en Copyright © 2023 Zhang and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Zhang, Yanqin Li, Zhiyuan RF_phage virion: Classification of phage virion proteins with a random forest model |
title | RF_phage virion: Classification of phage virion proteins with a random forest model |
title_full | RF_phage virion: Classification of phage virion proteins with a random forest model |
title_fullStr | RF_phage virion: Classification of phage virion proteins with a random forest model |
title_full_unstemmed | RF_phage virion: Classification of phage virion proteins with a random forest model |
title_short | RF_phage virion: Classification of phage virion proteins with a random forest model |
title_sort | rf_phage virion: classification of phage virion proteins with a random forest model |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945117/ https://www.ncbi.nlm.nih.gov/pubmed/36846294 http://dx.doi.org/10.3389/fgene.2022.1103783 |
work_keys_str_mv | AT zhangyanqin rfphagevirionclassificationofphagevirionproteinswitharandomforestmodel AT lizhiyuan rfphagevirionclassificationofphagevirionproteinswitharandomforestmodel |