Cargando…

RF_phage virion: Classification of phage virion proteins with a random forest model

Introduction: Phages play essential roles in biological procession, and the virion proteins encoded by the phage genome constitute critical elements of the assembled phage particle. Methods: This study uses machine learning methods to classify phage virion proteins. We proposed a novel approach, RF_...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yanqin, Li, Zhiyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945117/
https://www.ncbi.nlm.nih.gov/pubmed/36846294
http://dx.doi.org/10.3389/fgene.2022.1103783
_version_ 1784892070473760768
author Zhang, Yanqin
Li, Zhiyuan
author_facet Zhang, Yanqin
Li, Zhiyuan
author_sort Zhang, Yanqin
collection PubMed
description Introduction: Phages play essential roles in biological procession, and the virion proteins encoded by the phage genome constitute critical elements of the assembled phage particle. Methods: This study uses machine learning methods to classify phage virion proteins. We proposed a novel approach, RF_phage virion, for the effective classification of the virion and non-virion proteins. The model uses four protein sequence coding methods as features, and the random forest algorithm was employed to solve the classification problem. Results: The performance of the RF_phage virion model was analyzed by comparing the performance of this algorithm with that of classical machine learning methods. The proposed method achieved a specificity (Sp) of 93.37%%, sensitivity (Sn) of 90.30%, accuracy (Acc) of 91.84%, Matthews correlation coefficient (MCC) of .8371, and an F1 score of .9196.
format Online
Article
Text
id pubmed-9945117
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99451172023-02-23 RF_phage virion: Classification of phage virion proteins with a random forest model Zhang, Yanqin Li, Zhiyuan Front Genet Genetics Introduction: Phages play essential roles in biological procession, and the virion proteins encoded by the phage genome constitute critical elements of the assembled phage particle. Methods: This study uses machine learning methods to classify phage virion proteins. We proposed a novel approach, RF_phage virion, for the effective classification of the virion and non-virion proteins. The model uses four protein sequence coding methods as features, and the random forest algorithm was employed to solve the classification problem. Results: The performance of the RF_phage virion model was analyzed by comparing the performance of this algorithm with that of classical machine learning methods. The proposed method achieved a specificity (Sp) of 93.37%%, sensitivity (Sn) of 90.30%, accuracy (Acc) of 91.84%, Matthews correlation coefficient (MCC) of .8371, and an F1 score of .9196. Frontiers Media S.A. 2023-02-08 /pmc/articles/PMC9945117/ /pubmed/36846294 http://dx.doi.org/10.3389/fgene.2022.1103783 Text en Copyright © 2023 Zhang and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhang, Yanqin
Li, Zhiyuan
RF_phage virion: Classification of phage virion proteins with a random forest model
title RF_phage virion: Classification of phage virion proteins with a random forest model
title_full RF_phage virion: Classification of phage virion proteins with a random forest model
title_fullStr RF_phage virion: Classification of phage virion proteins with a random forest model
title_full_unstemmed RF_phage virion: Classification of phage virion proteins with a random forest model
title_short RF_phage virion: Classification of phage virion proteins with a random forest model
title_sort rf_phage virion: classification of phage virion proteins with a random forest model
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945117/
https://www.ncbi.nlm.nih.gov/pubmed/36846294
http://dx.doi.org/10.3389/fgene.2022.1103783
work_keys_str_mv AT zhangyanqin rfphagevirionclassificationofphagevirionproteinswitharandomforestmodel
AT lizhiyuan rfphagevirionclassificationofphagevirionproteinswitharandomforestmodel