Cargando…
The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses
Bootstrap methods are widely used in statistics, and bootstrapping of residuals can be especially useful in the regression context. However, difficulties are encountered extending residual resampling to regression settings where residuals are not identically distributed (thus not amenable to bootstr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524334/ https://www.ncbi.nlm.nih.gov/pubmed/28738071 http://dx.doi.org/10.1371/journal.pone.0181790 |
_version_ | 1783252448304955392 |
---|---|
author | Warton, David I. Thibaut, Loïc Wang, Yi Alice |
author_facet | Warton, David I. Thibaut, Loïc Wang, Yi Alice |
author_sort | Warton, David I. |
collection | PubMed |
description | Bootstrap methods are widely used in statistics, and bootstrapping of residuals can be especially useful in the regression context. However, difficulties are encountered extending residual resampling to regression settings where residuals are not identically distributed (thus not amenable to bootstrapping)—common examples including logistic or Poisson regression and generalizations to handle clustered or multivariate data, such as generalised estimating equations. We propose a bootstrap method based on probability integral transform (PIT-) residuals, which we call the PIT-trap, which assumes data come from some marginal distribution F of known parametric form. This method can be understood as a type of “model-free bootstrap”, adapted to the problem of discrete and highly multivariate data. PIT-residuals have the key property that they are (asymptotically) pivotal. The PIT-trap thus inherits the key property, not afforded by any other residual resampling approach, that the marginal distribution of data can be preserved under PIT-trapping. This in turn enables the derivation of some standard bootstrap properties, including second-order correctness of pivotal PIT-trap test statistics. In multivariate data, bootstrapping rows of PIT-residuals affords the property that it preserves correlation in data without the need for it to be modelled, a key point of difference as compared to a parametric bootstrap. The proposed method is illustrated on an example involving multivariate abundance data in ecology, and demonstrated via simulation to have improved properties as compared to competing resampling methods. |
format | Online Article Text |
id | pubmed-5524334 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-55243342017-08-07 The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses Warton, David I. Thibaut, Loïc Wang, Yi Alice PLoS One Research Article Bootstrap methods are widely used in statistics, and bootstrapping of residuals can be especially useful in the regression context. However, difficulties are encountered extending residual resampling to regression settings where residuals are not identically distributed (thus not amenable to bootstrapping)—common examples including logistic or Poisson regression and generalizations to handle clustered or multivariate data, such as generalised estimating equations. We propose a bootstrap method based on probability integral transform (PIT-) residuals, which we call the PIT-trap, which assumes data come from some marginal distribution F of known parametric form. This method can be understood as a type of “model-free bootstrap”, adapted to the problem of discrete and highly multivariate data. PIT-residuals have the key property that they are (asymptotically) pivotal. The PIT-trap thus inherits the key property, not afforded by any other residual resampling approach, that the marginal distribution of data can be preserved under PIT-trapping. This in turn enables the derivation of some standard bootstrap properties, including second-order correctness of pivotal PIT-trap test statistics. In multivariate data, bootstrapping rows of PIT-residuals affords the property that it preserves correlation in data without the need for it to be modelled, a key point of difference as compared to a parametric bootstrap. The proposed method is illustrated on an example involving multivariate abundance data in ecology, and demonstrated via simulation to have improved properties as compared to competing resampling methods. Public Library of Science 2017-07-24 /pmc/articles/PMC5524334/ /pubmed/28738071 http://dx.doi.org/10.1371/journal.pone.0181790 Text en © 2017 Warton et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Warton, David I. Thibaut, Loïc Wang, Yi Alice The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses |
title | The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses |
title_full | The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses |
title_fullStr | The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses |
title_full_unstemmed | The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses |
title_short | The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses |
title_sort | pit-trap—a “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524334/ https://www.ncbi.nlm.nih.gov/pubmed/28738071 http://dx.doi.org/10.1371/journal.pone.0181790 |
work_keys_str_mv | AT wartondavidi thepittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses AT thibautloic thepittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses AT wangyialice thepittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses AT wartondavidi pittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses AT thibautloic pittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses AT wangyialice pittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses |