Cargando…

A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients

BACKGROUND: Therapeutics against the envelope (Env) proteins of human immunodeficiency virus type 1 (HIV-1) effectively reduce viral loads in patients. However, due to mutations, new therapy-resistant Env variants frequently emerge. The sites of mutations on Env that appear in each patient are consi...

Descripción completa

Detalles Bibliográficos
Autores principales: Fili, Mohammad, Hu, Guiping, Han, Changze, Kort, Alexa, Trettin, John, Haim, Hillel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280944/
https://www.ncbi.nlm.nih.gov/pubmed/37337202
http://dx.doi.org/10.1186/s13015-023-00228-0
_version_ 1785060909934182400
author Fili, Mohammad
Hu, Guiping
Han, Changze
Kort, Alexa
Trettin, John
Haim, Hillel
author_facet Fili, Mohammad
Hu, Guiping
Han, Changze
Kort, Alexa
Trettin, John
Haim, Hillel
author_sort Fili, Mohammad
collection PubMed
description BACKGROUND: Therapeutics against the envelope (Env) proteins of human immunodeficiency virus type 1 (HIV-1) effectively reduce viral loads in patients. However, due to mutations, new therapy-resistant Env variants frequently emerge. The sites of mutations on Env that appear in each patient are considered random and unpredictable. Here we developed an algorithm to estimate for each patient the mutational state of each position based on the mutational state of adjacent positions on the three-dimensional structure of the protein. METHODS: We developed a dynamic ensemble selection algorithm designated k-best classifiers. It identifies the best classifiers within the neighborhood of a new observation and applies them to predict the variability state of each observation. To evaluate the algorithm, we applied amino acid sequences of Envs from 300 HIV-1-infected individuals (at least six sequences per patient). For each patient, amino acid variability values at all Env positions were mapped onto the three-dimensional structure of the protein. Then, the variability state of each position was estimated by the variability at adjacent positions of the protein. RESULTS: The proposed algorithm showed higher performance than the base learner and a panel of classification algorithms. The mutational state of positions in the high-mannose patch and CD4-binding site of Env, which are targeted by multiple therapeutics, was predicted well. Importantly, the algorithm outperformed other classification techniques for predicting the variability state at multi-position footprints of therapeutics on Env. CONCLUSIONS: The proposed algorithm applies a dynamic classifier-scoring approach that increases its performance relative to other classification methods. Better understanding of the spatiotemporal patterns of variability across Env may lead to new treatment strategies that are tailored to the unique mutational patterns of each patient. More generally, we propose the algorithm as a new high-performance dynamic ensemble selection technique.
format Online
Article
Text
id pubmed-10280944
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102809442023-06-21 A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients Fili, Mohammad Hu, Guiping Han, Changze Kort, Alexa Trettin, John Haim, Hillel Algorithms Mol Biol Research BACKGROUND: Therapeutics against the envelope (Env) proteins of human immunodeficiency virus type 1 (HIV-1) effectively reduce viral loads in patients. However, due to mutations, new therapy-resistant Env variants frequently emerge. The sites of mutations on Env that appear in each patient are considered random and unpredictable. Here we developed an algorithm to estimate for each patient the mutational state of each position based on the mutational state of adjacent positions on the three-dimensional structure of the protein. METHODS: We developed a dynamic ensemble selection algorithm designated k-best classifiers. It identifies the best classifiers within the neighborhood of a new observation and applies them to predict the variability state of each observation. To evaluate the algorithm, we applied amino acid sequences of Envs from 300 HIV-1-infected individuals (at least six sequences per patient). For each patient, amino acid variability values at all Env positions were mapped onto the three-dimensional structure of the protein. Then, the variability state of each position was estimated by the variability at adjacent positions of the protein. RESULTS: The proposed algorithm showed higher performance than the base learner and a panel of classification algorithms. The mutational state of positions in the high-mannose patch and CD4-binding site of Env, which are targeted by multiple therapeutics, was predicted well. Importantly, the algorithm outperformed other classification techniques for predicting the variability state at multi-position footprints of therapeutics on Env. CONCLUSIONS: The proposed algorithm applies a dynamic classifier-scoring approach that increases its performance relative to other classification methods. Better understanding of the spatiotemporal patterns of variability across Env may lead to new treatment strategies that are tailored to the unique mutational patterns of each patient. More generally, we propose the algorithm as a new high-performance dynamic ensemble selection technique. BioMed Central 2023-06-19 /pmc/articles/PMC10280944/ /pubmed/37337202 http://dx.doi.org/10.1186/s13015-023-00228-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Fili, Mohammad
Hu, Guiping
Han, Changze
Kort, Alexa
Trettin, John
Haim, Hillel
A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients
title A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients
title_full A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients
title_fullStr A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients
title_full_unstemmed A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients
title_short A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients
title_sort classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in hiv-infected patients
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280944/
https://www.ncbi.nlm.nih.gov/pubmed/37337202
http://dx.doi.org/10.1186/s13015-023-00228-0
work_keys_str_mv AT filimohammad aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT huguiping aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT hanchangze aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT kortalexa aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT trettinjohn aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT haimhillel aclassificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT filimohammad classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT huguiping classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT hanchangze classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT kortalexa classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT trettinjohn classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients
AT haimhillel classificationalgorithmbasedondynamicensembleselectiontopredictmutationalpatternsoftheenvelopeproteininhivinfectedpatients