Cargando…

The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses

Bootstrap methods are widely used in statistics, and bootstrapping of residuals can be especially useful in the regression context. However, difficulties are encountered extending residual resampling to regression settings where residuals are not identically distributed (thus not amenable to bootstr...

Descripción completa

Detalles Bibliográficos
Autores principales: Warton, David I., Thibaut, Loïc, Wang, Yi Alice
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524334/
https://www.ncbi.nlm.nih.gov/pubmed/28738071
http://dx.doi.org/10.1371/journal.pone.0181790
_version_ 1783252448304955392
author Warton, David I.
Thibaut, Loïc
Wang, Yi Alice
author_facet Warton, David I.
Thibaut, Loïc
Wang, Yi Alice
author_sort Warton, David I.
collection PubMed
description Bootstrap methods are widely used in statistics, and bootstrapping of residuals can be especially useful in the regression context. However, difficulties are encountered extending residual resampling to regression settings where residuals are not identically distributed (thus not amenable to bootstrapping)—common examples including logistic or Poisson regression and generalizations to handle clustered or multivariate data, such as generalised estimating equations. We propose a bootstrap method based on probability integral transform (PIT-) residuals, which we call the PIT-trap, which assumes data come from some marginal distribution F of known parametric form. This method can be understood as a type of “model-free bootstrap”, adapted to the problem of discrete and highly multivariate data. PIT-residuals have the key property that they are (asymptotically) pivotal. The PIT-trap thus inherits the key property, not afforded by any other residual resampling approach, that the marginal distribution of data can be preserved under PIT-trapping. This in turn enables the derivation of some standard bootstrap properties, including second-order correctness of pivotal PIT-trap test statistics. In multivariate data, bootstrapping rows of PIT-residuals affords the property that it preserves correlation in data without the need for it to be modelled, a key point of difference as compared to a parametric bootstrap. The proposed method is illustrated on an example involving multivariate abundance data in ecology, and demonstrated via simulation to have improved properties as compared to competing resampling methods.
format Online
Article
Text
id pubmed-5524334
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55243342017-08-07 The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses Warton, David I. Thibaut, Loïc Wang, Yi Alice PLoS One Research Article Bootstrap methods are widely used in statistics, and bootstrapping of residuals can be especially useful in the regression context. However, difficulties are encountered extending residual resampling to regression settings where residuals are not identically distributed (thus not amenable to bootstrapping)—common examples including logistic or Poisson regression and generalizations to handle clustered or multivariate data, such as generalised estimating equations. We propose a bootstrap method based on probability integral transform (PIT-) residuals, which we call the PIT-trap, which assumes data come from some marginal distribution F of known parametric form. This method can be understood as a type of “model-free bootstrap”, adapted to the problem of discrete and highly multivariate data. PIT-residuals have the key property that they are (asymptotically) pivotal. The PIT-trap thus inherits the key property, not afforded by any other residual resampling approach, that the marginal distribution of data can be preserved under PIT-trapping. This in turn enables the derivation of some standard bootstrap properties, including second-order correctness of pivotal PIT-trap test statistics. In multivariate data, bootstrapping rows of PIT-residuals affords the property that it preserves correlation in data without the need for it to be modelled, a key point of difference as compared to a parametric bootstrap. The proposed method is illustrated on an example involving multivariate abundance data in ecology, and demonstrated via simulation to have improved properties as compared to competing resampling methods. Public Library of Science 2017-07-24 /pmc/articles/PMC5524334/ /pubmed/28738071 http://dx.doi.org/10.1371/journal.pone.0181790 Text en © 2017 Warton et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Warton, David I.
Thibaut, Loïc
Wang, Yi Alice
The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses
title The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses
title_full The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses
title_fullStr The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses
title_full_unstemmed The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses
title_short The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses
title_sort pit-trap—a “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524334/
https://www.ncbi.nlm.nih.gov/pubmed/28738071
http://dx.doi.org/10.1371/journal.pone.0181790
work_keys_str_mv AT wartondavidi thepittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses
AT thibautloic thepittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses
AT wangyialice thepittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses
AT wartondavidi pittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses
AT thibautloic pittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses
AT wangyialice pittrapamodelfreebootstrapprocedureforinferenceaboutregressionmodelswithdiscretemultivariateresponses