Cargando…

Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data

Background. Many statistical models have been tested to predict phenotypic or virological response from genotypic data. A statistical framework called Super Learner has been introduced either to compare different methods/learners (discrete Super Learner) or to combine them in a Super Learner predict...

Descripción completa

Detalles Bibliográficos
Autores principales: Houssaïni, Allal, Assoumou, Lambert, Marcelin, Anne Geneviève, Molina, Jean Michel, Calvez, Vincent, Flandre, Philippe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3324131/
https://www.ncbi.nlm.nih.gov/pubmed/22550568
http://dx.doi.org/10.1155/2012/478467
_version_ 1782229280539803648
author Houssaïni, Allal
Assoumou, Lambert
Marcelin, Anne Geneviève
Molina, Jean Michel
Calvez, Vincent
Flandre, Philippe
author_facet Houssaïni, Allal
Assoumou, Lambert
Marcelin, Anne Geneviève
Molina, Jean Michel
Calvez, Vincent
Flandre, Philippe
author_sort Houssaïni, Allal
collection PubMed
description Background. Many statistical models have been tested to predict phenotypic or virological response from genotypic data. A statistical framework called Super Learner has been introduced either to compare different methods/learners (discrete Super Learner) or to combine them in a Super Learner prediction method. Methods. The Jaguar trial is used to apply the Super Learner framework. The Jaguar study is an “add-on” trial comparing the efficacy of adding didanosine to an on-going failing regimen. Our aim was also to investigate the impact on the use of different cross-validation strategies and different loss functions. Four different repartitions between training set and validations set were tested through two loss functions. Six statistical methods were compared. We assess performance by evaluating R (2) values and accuracy by calculating the rates of patients being correctly classified. Results. Our results indicated that the more recent Super Learner methodology of building a new predictor based on a weighted combination of different methods/learners provided good performance. A simple linear model provided similar results to those of this new predictor. Slight discrepancy arises between the two loss functions investigated, and slight difference arises also between results based on cross-validated risks and results from full dataset. The Super Learner methodology and linear model provided around 80% of patients correctly classified. The difference between the lower and higher rates is around 10 percent. The number of mutations retained in different learners also varys from one to 41. Conclusions. The more recent Super Learner methodology combining the prediction of many learners provided good performance on our small dataset.
format Online
Article
Text
id pubmed-3324131
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-33241312012-05-01 Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data Houssaïni, Allal Assoumou, Lambert Marcelin, Anne Geneviève Molina, Jean Michel Calvez, Vincent Flandre, Philippe AIDS Res Treat Research Article Background. Many statistical models have been tested to predict phenotypic or virological response from genotypic data. A statistical framework called Super Learner has been introduced either to compare different methods/learners (discrete Super Learner) or to combine them in a Super Learner prediction method. Methods. The Jaguar trial is used to apply the Super Learner framework. The Jaguar study is an “add-on” trial comparing the efficacy of adding didanosine to an on-going failing regimen. Our aim was also to investigate the impact on the use of different cross-validation strategies and different loss functions. Four different repartitions between training set and validations set were tested through two loss functions. Six statistical methods were compared. We assess performance by evaluating R (2) values and accuracy by calculating the rates of patients being correctly classified. Results. Our results indicated that the more recent Super Learner methodology of building a new predictor based on a weighted combination of different methods/learners provided good performance. A simple linear model provided similar results to those of this new predictor. Slight discrepancy arises between the two loss functions investigated, and slight difference arises also between results based on cross-validated risks and results from full dataset. The Super Learner methodology and linear model provided around 80% of patients correctly classified. The difference between the lower and higher rates is around 10 percent. The number of mutations retained in different learners also varys from one to 41. Conclusions. The more recent Super Learner methodology combining the prediction of many learners provided good performance on our small dataset. Hindawi Publishing Corporation 2012 2012-04-03 /pmc/articles/PMC3324131/ /pubmed/22550568 http://dx.doi.org/10.1155/2012/478467 Text en Copyright © 2012 Allal Houssaïni et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Houssaïni, Allal
Assoumou, Lambert
Marcelin, Anne Geneviève
Molina, Jean Michel
Calvez, Vincent
Flandre, Philippe
Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data
title Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data
title_full Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data
title_fullStr Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data
title_full_unstemmed Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data
title_short Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data
title_sort investigation of super learner methodology on hiv-1 small sample: application on jaguar trial data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3324131/
https://www.ncbi.nlm.nih.gov/pubmed/22550568
http://dx.doi.org/10.1155/2012/478467
work_keys_str_mv AT houssainiallal investigationofsuperlearnermethodologyonhiv1smallsampleapplicationonjaguartrialdata
AT assoumoulambert investigationofsuperlearnermethodologyonhiv1smallsampleapplicationonjaguartrialdata
AT marcelinannegenevieve investigationofsuperlearnermethodologyonhiv1smallsampleapplicationonjaguartrialdata
AT molinajeanmichel investigationofsuperlearnermethodologyonhiv1smallsampleapplicationonjaguartrialdata
AT calvezvincent investigationofsuperlearnermethodologyonhiv1smallsampleapplicationonjaguartrialdata
AT flandrephilippe investigationofsuperlearnermethodologyonhiv1smallsampleapplicationonjaguartrialdata