Cargando…
A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients
BACKGROUND: Therapeutics against the envelope (Env) proteins of human immunodeficiency virus type 1 (HIV-1) effectively reduce viral loads in patients. However, due to mutations, new therapy-resistant Env variants frequently emerge. The sites of mutations on Env that appear in each patient are consi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280944/ https://www.ncbi.nlm.nih.gov/pubmed/37337202 http://dx.doi.org/10.1186/s13015-023-00228-0 |
_version_ | 1785060909934182400 |
---|---|
author | Fili, Mohammad Hu, Guiping Han, Changze Kort, Alexa Trettin, John Haim, Hillel |
author_facet | Fili, Mohammad Hu, Guiping Han, Changze Kort, Alexa Trettin, John Haim, Hillel |
author_sort | Fili, Mohammad |
collection | PubMed |
description | BACKGROUND: Therapeutics against the envelope (Env) proteins of human immunodeficiency virus type 1 (HIV-1) effectively reduce viral loads in patients. However, due to mutations, new therapy-resistant Env variants frequently emerge. The sites of mutations on Env that appear in each patient are considered random and unpredictable. Here we developed an algorithm to estimate for each patient the mutational state of each position based on the mutational state of adjacent positions on the three-dimensional structure of the protein. METHODS: We developed a dynamic ensemble selection algorithm designated k-best classifiers. It identifies the best classifiers within the neighborhood of a new observation and applies them to predict the variability state of each observation. To evaluate the algorithm, we applied amino acid sequences of Envs from 300 HIV-1-infected individuals (at least six sequences per patient). For each patient, amino acid variability values at all Env positions were mapped onto the three-dimensional structure of the protein. Then, the variability state of each position was estimated by the variability at adjacent positions of the protein. RESULTS: The proposed algorithm showed higher performance than the base learner and a panel of classification algorithms. The mutational state of positions in the high-mannose patch and CD4-binding site of Env, which are targeted by multiple therapeutics, was predicted well. Importantly, the algorithm outperformed other classification techniques for predicting the variability state at multi-position footprints of therapeutics on Env. CONCLUSIONS: The proposed algorithm applies a dynamic classifier-scoring approach that increases its performance relative to other classification methods. Better understanding of the spatiotemporal patterns of variability across Env may lead to new treatment strategies that are tailored to the unique mutational patterns of each patient. More generally, we propose the algorithm as a new high-performance dynamic ensemble selection technique. |
format | Online Article Text |
id | pubmed-10280944 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-102809442023-06-21 A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients Fili, Mohammad Hu, Guiping Han, Changze Kort, Alexa Trettin, John Haim, Hillel Algorithms Mol Biol Research BACKGROUND: Therapeutics against the envelope (Env) proteins of human immunodeficiency virus type 1 (HIV-1) effectively reduce viral loads in patients. However, due to mutations, new therapy-resistant Env variants frequently emerge. The sites of mutations on Env that appear in each patient are considered random and unpredictable. Here we developed an algorithm to estimate for each patient the mutational state of each position based on the mutational state of adjacent positions on the three-dimensional structure of the protein. METHODS: We developed a dynamic ensemble selection algorithm designated k-best classifiers. It identifies the best classifiers within the neighborhood of a new observation and applies them to predict the variability state of each observation. To evaluate the algorithm, we applied amino acid sequences of Envs from 300 HIV-1-infected individuals (at least six sequences per patient). For each patient, amino acid variability values at all Env positions were mapped onto the three-dimensional structure of the protein. Then, the variability state of each position was estimated by the variability at adjacent positions of the protein. RESULTS: The proposed algorithm showed higher performance than the base learner and a panel of classification algorithms. The mutational state of positions in the high-mannose patch and CD4-binding site of Env, which are targeted by multiple therapeutics, was predicted well. Importantly, the algorithm outperformed other classification techniques for predicting the variability state at multi-position footprints of therapeutics on Env. CONCLUSIONS: The proposed algorithm applies a dynamic classifier-scoring approach that increases its performance relative to other classification methods. Better understanding of the spatiotemporal patterns of variability across Env may lead to new treatment strategies that are tailored to the unique mutational patterns of each patient. More generally, we propose the algorithm as a new high-performance dynamic ensemble selection technique. BioMed Central 2023-06-19 /pmc/articles/PMC10280944/ /pubmed/37337202 http://dx.doi.org/10.1186/s13015-023-00228-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Fili, Mohammad Hu, Guiping Han, Changze Kort, Alexa Trettin, John Haim, Hillel A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients |
title | A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients |
title_full | A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients |
title_fullStr | A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients |
title_full_unstemmed | A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients |
title_short | A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients |
title_sort | classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in hiv-infected patients |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280944/ https://www.ncbi.nlm.nih.gov/pubmed/37337202 http://dx.doi.org/10.1186/s13015-023-00228-0 |
work_keys_str_mv | AT filimohammad aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT huguiping aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT hanchangze aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT kortalexa aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT trettinjohn aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT haimhillel aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT filimohammad classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT huguiping classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT hanchangze classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT kortalexa classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT trettinjohn classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients AT haimhillel classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients |