Cargando…

Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses

BACKGROUND: Variable selection is an important step in building a multivariate regression model for which several methods and statistical packages are available. A comprehensive approach for variable selection in complex multivariate regression analyses within HIV cohorts is explored by utilizing bo...

Descripción completa

Detalles Bibliográficos
Autores principales: Rentsch, Christopher, Bebu, Ionut, Guest, Jodie L., Rimland, David, Agan, Brian K., Marconi, Vincent
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906149/
https://www.ncbi.nlm.nih.gov/pubmed/24489902
http://dx.doi.org/10.1371/journal.pone.0087352
_version_ 1782301448529248256
author Rentsch, Christopher
Bebu, Ionut
Guest, Jodie L.
Rimland, David
Agan, Brian K.
Marconi, Vincent
author_facet Rentsch, Christopher
Bebu, Ionut
Guest, Jodie L.
Rimland, David
Agan, Brian K.
Marconi, Vincent
author_sort Rentsch, Christopher
collection PubMed
description BACKGROUND: Variable selection is an important step in building a multivariate regression model for which several methods and statistical packages are available. A comprehensive approach for variable selection in complex multivariate regression analyses within HIV cohorts is explored by utilizing both epidemiological and biostatistical procedures. METHODS: Three different methods for variable selection were illustrated in a study comparing survival time between subjects in the Department of Defense’s National History Study and the Atlanta Veterans Affairs Medical Center’s HIV Atlanta VA Cohort Study. The first two methods were stepwise selection procedures, based either on significance tests (Score test), or on information theory (Akaike Information Criterion), while the third method employed a Bayesian argument (Bayesian Model Averaging). RESULTS: All three methods resulted in a similar parsimonious survival model. Three of the covariates previously used in the multivariate model were not included in the final model suggested by the three approaches. When comparing the parsimonious model to the previously published model, there was evidence of less variance in the main survival estimates. CONCLUSIONS: The variable selection approaches considered in this study allowed building a model based on significance tests, on an information criterion, and on averaging models using their posterior probabilities. A parsimonious model that balanced these three approaches was found to provide a better fit than the previously reported model.
format Online
Article
Text
id pubmed-3906149
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39061492014-01-31 Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses Rentsch, Christopher Bebu, Ionut Guest, Jodie L. Rimland, David Agan, Brian K. Marconi, Vincent PLoS One Research Article BACKGROUND: Variable selection is an important step in building a multivariate regression model for which several methods and statistical packages are available. A comprehensive approach for variable selection in complex multivariate regression analyses within HIV cohorts is explored by utilizing both epidemiological and biostatistical procedures. METHODS: Three different methods for variable selection were illustrated in a study comparing survival time between subjects in the Department of Defense’s National History Study and the Atlanta Veterans Affairs Medical Center’s HIV Atlanta VA Cohort Study. The first two methods were stepwise selection procedures, based either on significance tests (Score test), or on information theory (Akaike Information Criterion), while the third method employed a Bayesian argument (Bayesian Model Averaging). RESULTS: All three methods resulted in a similar parsimonious survival model. Three of the covariates previously used in the multivariate model were not included in the final model suggested by the three approaches. When comparing the parsimonious model to the previously published model, there was evidence of less variance in the main survival estimates. CONCLUSIONS: The variable selection approaches considered in this study allowed building a model based on significance tests, on an information criterion, and on averaging models using their posterior probabilities. A parsimonious model that balanced these three approaches was found to provide a better fit than the previously reported model. Public Library of Science 2014-01-29 /pmc/articles/PMC3906149/ /pubmed/24489902 http://dx.doi.org/10.1371/journal.pone.0087352 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Rentsch, Christopher
Bebu, Ionut
Guest, Jodie L.
Rimland, David
Agan, Brian K.
Marconi, Vincent
Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses
title Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses
title_full Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses
title_fullStr Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses
title_full_unstemmed Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses
title_short Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses
title_sort combining epidemiologic and biostatistical tools to enhance variable selection in hiv cohort analyses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906149/
https://www.ncbi.nlm.nih.gov/pubmed/24489902
http://dx.doi.org/10.1371/journal.pone.0087352
work_keys_str_mv AT rentschchristopher combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses
AT bebuionut combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses
AT guestjodiel combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses
AT rimlanddavid combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses
AT aganbriank combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses
AT marconivincent combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses