Cargando…
Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses
BACKGROUND: Variable selection is an important step in building a multivariate regression model for which several methods and statistical packages are available. A comprehensive approach for variable selection in complex multivariate regression analyses within HIV cohorts is explored by utilizing bo...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906149/ https://www.ncbi.nlm.nih.gov/pubmed/24489902 http://dx.doi.org/10.1371/journal.pone.0087352 |
_version_ | 1782301448529248256 |
---|---|
author | Rentsch, Christopher Bebu, Ionut Guest, Jodie L. Rimland, David Agan, Brian K. Marconi, Vincent |
author_facet | Rentsch, Christopher Bebu, Ionut Guest, Jodie L. Rimland, David Agan, Brian K. Marconi, Vincent |
author_sort | Rentsch, Christopher |
collection | PubMed |
description | BACKGROUND: Variable selection is an important step in building a multivariate regression model for which several methods and statistical packages are available. A comprehensive approach for variable selection in complex multivariate regression analyses within HIV cohorts is explored by utilizing both epidemiological and biostatistical procedures. METHODS: Three different methods for variable selection were illustrated in a study comparing survival time between subjects in the Department of Defense’s National History Study and the Atlanta Veterans Affairs Medical Center’s HIV Atlanta VA Cohort Study. The first two methods were stepwise selection procedures, based either on significance tests (Score test), or on information theory (Akaike Information Criterion), while the third method employed a Bayesian argument (Bayesian Model Averaging). RESULTS: All three methods resulted in a similar parsimonious survival model. Three of the covariates previously used in the multivariate model were not included in the final model suggested by the three approaches. When comparing the parsimonious model to the previously published model, there was evidence of less variance in the main survival estimates. CONCLUSIONS: The variable selection approaches considered in this study allowed building a model based on significance tests, on an information criterion, and on averaging models using their posterior probabilities. A parsimonious model that balanced these three approaches was found to provide a better fit than the previously reported model. |
format | Online Article Text |
id | pubmed-3906149 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-39061492014-01-31 Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses Rentsch, Christopher Bebu, Ionut Guest, Jodie L. Rimland, David Agan, Brian K. Marconi, Vincent PLoS One Research Article BACKGROUND: Variable selection is an important step in building a multivariate regression model for which several methods and statistical packages are available. A comprehensive approach for variable selection in complex multivariate regression analyses within HIV cohorts is explored by utilizing both epidemiological and biostatistical procedures. METHODS: Three different methods for variable selection were illustrated in a study comparing survival time between subjects in the Department of Defense’s National History Study and the Atlanta Veterans Affairs Medical Center’s HIV Atlanta VA Cohort Study. The first two methods were stepwise selection procedures, based either on significance tests (Score test), or on information theory (Akaike Information Criterion), while the third method employed a Bayesian argument (Bayesian Model Averaging). RESULTS: All three methods resulted in a similar parsimonious survival model. Three of the covariates previously used in the multivariate model were not included in the final model suggested by the three approaches. When comparing the parsimonious model to the previously published model, there was evidence of less variance in the main survival estimates. CONCLUSIONS: The variable selection approaches considered in this study allowed building a model based on significance tests, on an information criterion, and on averaging models using their posterior probabilities. A parsimonious model that balanced these three approaches was found to provide a better fit than the previously reported model. Public Library of Science 2014-01-29 /pmc/articles/PMC3906149/ /pubmed/24489902 http://dx.doi.org/10.1371/journal.pone.0087352 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. |
spellingShingle | Research Article Rentsch, Christopher Bebu, Ionut Guest, Jodie L. Rimland, David Agan, Brian K. Marconi, Vincent Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses |
title | Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses |
title_full | Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses |
title_fullStr | Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses |
title_full_unstemmed | Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses |
title_short | Combining Epidemiologic and Biostatistical Tools to Enhance Variable Selection in HIV Cohort Analyses |
title_sort | combining epidemiologic and biostatistical tools to enhance variable selection in hiv cohort analyses |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3906149/ https://www.ncbi.nlm.nih.gov/pubmed/24489902 http://dx.doi.org/10.1371/journal.pone.0087352 |
work_keys_str_mv | AT rentschchristopher combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses AT bebuionut combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses AT guestjodiel combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses AT rimlanddavid combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses AT aganbriank combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses AT marconivincent combiningepidemiologicandbiostatisticaltoolstoenhancevariableselectioninhivcohortanalyses |