Cargando…

Quantitative prediction of integrase inhibitor resistance from genotype through consensus linear regression modeling

BACKGROUND: Integrase inhibitors (INI) form a new drug class in the treatment of HIV-1 patients. We developed a linear regression modeling approach to make a quantitative raltegravir (RAL) resistance phenotype prediction, as Fold Change in IC50 against a wild type virus, from mutations in the integr...

Descripción completa

Detalles Bibliográficos
Autores principales: Van der Borght, Koen, Verheyen, Ann, Feyaerts, Maxim, Van Wesenbeeck, Liesbeth, Verlinden, Yvan, Van Craenenbroeck, Elke, van Vlijmen, Herman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3551713/
https://www.ncbi.nlm.nih.gov/pubmed/23282253
http://dx.doi.org/10.1186/1743-422X-10-8
_version_ 1782256598412951552
author Van der Borght, Koen
Verheyen, Ann
Feyaerts, Maxim
Van Wesenbeeck, Liesbeth
Verlinden, Yvan
Van Craenenbroeck, Elke
van Vlijmen, Herman
author_facet Van der Borght, Koen
Verheyen, Ann
Feyaerts, Maxim
Van Wesenbeeck, Liesbeth
Verlinden, Yvan
Van Craenenbroeck, Elke
van Vlijmen, Herman
author_sort Van der Borght, Koen
collection PubMed
description BACKGROUND: Integrase inhibitors (INI) form a new drug class in the treatment of HIV-1 patients. We developed a linear regression modeling approach to make a quantitative raltegravir (RAL) resistance phenotype prediction, as Fold Change in IC50 against a wild type virus, from mutations in the integrase genotype. METHODS: We developed a clonal genotype-phenotype database with 991 clones from 153 clinical isolates of INI naïve and RAL treated patients, and 28 site-directed mutants. We did the development of the RAL linear regression model in two stages, employing a genetic algorithm (GA) to select integrase mutations by consensus. First, we ran multiple GAs to generate first order linear regression models (GA models) that were stochastically optimized to reach a goal R(2 )accuracy, and consisted of a fixed-length subset of integrase mutations to estimate INI resistance. Secondly, we derived a consensus linear regression model in a forward stepwise regression procedure, considering integrase mutations or mutation pairs by descending prevalence in the GA models. RESULTS: The most frequently occurring mutations in the GA models were 92Q, 97A, 143R and 155H (all 100%), 143G (90%), 148H/R (89%), 148K (88%), 151I (81%), 121Y (75%), 143C (72%), and 74M (69%). The RAL second order model contained 30 single mutations and five mutation pairs (p < 0.01): 143C/R&97A, 155H&97A/151I and 74M&151I. The R(2 )performance of this model on the clonal training data was 0.97, and 0.78 on an unseen population genotype-phenotype dataset of 171 clinical isolates from RAL treated and INI naïve patients. CONCLUSIONS: We describe a systematic approach to derive a model for predicting INI resistance from a limited amount of clonal samples. Our RAL second order model is made available as an Additional file for calculating a resistance phenotype as the sum of integrase mutations and mutation pairs.
format Online
Article
Text
id pubmed-3551713
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35517132013-01-24 Quantitative prediction of integrase inhibitor resistance from genotype through consensus linear regression modeling Van der Borght, Koen Verheyen, Ann Feyaerts, Maxim Van Wesenbeeck, Liesbeth Verlinden, Yvan Van Craenenbroeck, Elke van Vlijmen, Herman Virol J Methodology BACKGROUND: Integrase inhibitors (INI) form a new drug class in the treatment of HIV-1 patients. We developed a linear regression modeling approach to make a quantitative raltegravir (RAL) resistance phenotype prediction, as Fold Change in IC50 against a wild type virus, from mutations in the integrase genotype. METHODS: We developed a clonal genotype-phenotype database with 991 clones from 153 clinical isolates of INI naïve and RAL treated patients, and 28 site-directed mutants. We did the development of the RAL linear regression model in two stages, employing a genetic algorithm (GA) to select integrase mutations by consensus. First, we ran multiple GAs to generate first order linear regression models (GA models) that were stochastically optimized to reach a goal R(2 )accuracy, and consisted of a fixed-length subset of integrase mutations to estimate INI resistance. Secondly, we derived a consensus linear regression model in a forward stepwise regression procedure, considering integrase mutations or mutation pairs by descending prevalence in the GA models. RESULTS: The most frequently occurring mutations in the GA models were 92Q, 97A, 143R and 155H (all 100%), 143G (90%), 148H/R (89%), 148K (88%), 151I (81%), 121Y (75%), 143C (72%), and 74M (69%). The RAL second order model contained 30 single mutations and five mutation pairs (p < 0.01): 143C/R&97A, 155H&97A/151I and 74M&151I. The R(2 )performance of this model on the clonal training data was 0.97, and 0.78 on an unseen population genotype-phenotype dataset of 171 clinical isolates from RAL treated and INI naïve patients. CONCLUSIONS: We describe a systematic approach to derive a model for predicting INI resistance from a limited amount of clonal samples. Our RAL second order model is made available as an Additional file for calculating a resistance phenotype as the sum of integrase mutations and mutation pairs. BioMed Central 2013-01-03 /pmc/articles/PMC3551713/ /pubmed/23282253 http://dx.doi.org/10.1186/1743-422X-10-8 Text en Copyright ©2013 Van der Borght et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Van der Borght, Koen
Verheyen, Ann
Feyaerts, Maxim
Van Wesenbeeck, Liesbeth
Verlinden, Yvan
Van Craenenbroeck, Elke
van Vlijmen, Herman
Quantitative prediction of integrase inhibitor resistance from genotype through consensus linear regression modeling
title Quantitative prediction of integrase inhibitor resistance from genotype through consensus linear regression modeling
title_full Quantitative prediction of integrase inhibitor resistance from genotype through consensus linear regression modeling
title_fullStr Quantitative prediction of integrase inhibitor resistance from genotype through consensus linear regression modeling
title_full_unstemmed Quantitative prediction of integrase inhibitor resistance from genotype through consensus linear regression modeling
title_short Quantitative prediction of integrase inhibitor resistance from genotype through consensus linear regression modeling
title_sort quantitative prediction of integrase inhibitor resistance from genotype through consensus linear regression modeling
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3551713/
https://www.ncbi.nlm.nih.gov/pubmed/23282253
http://dx.doi.org/10.1186/1743-422X-10-8
work_keys_str_mv AT vanderborghtkoen quantitativepredictionofintegraseinhibitorresistancefromgenotypethroughconsensuslinearregressionmodeling
AT verheyenann quantitativepredictionofintegraseinhibitorresistancefromgenotypethroughconsensuslinearregressionmodeling
AT feyaertsmaxim quantitativepredictionofintegraseinhibitorresistancefromgenotypethroughconsensuslinearregressionmodeling
AT vanwesenbeeckliesbeth quantitativepredictionofintegraseinhibitorresistancefromgenotypethroughconsensuslinearregressionmodeling
AT verlindenyvan quantitativepredictionofintegraseinhibitorresistancefromgenotypethroughconsensuslinearregressionmodeling
AT vancraenenbroeckelke quantitativepredictionofintegraseinhibitorresistancefromgenotypethroughconsensuslinearregressionmodeling
AT vanvlijmenherman quantitativepredictionofintegraseinhibitorresistancefromgenotypethroughconsensuslinearregressionmodeling