Cargando…

Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias?

In previous articles in the American Journal of Epidemiology (Am J Epidemiol. 2013;177(5):431–442) and American Journal of Public Health (Am J Public Health. 2013;103(10):1895–1901), Masters et al. reported age-specific hazard ratios for the contrasts in mortality rates between obesity categories. T...

Descripción completa

Detalles Bibliográficos
Autor principal:	Hanley, James A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2017
Materias:	Commentary
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5962938/ https://www.ncbi.nlm.nih.gov/pubmed/27923798 http://dx.doi.org/10.1093/aje/kww175

_version_	1783324968647393280
author	Hanley, James A.
author_facet	Hanley, James A.
author_sort	Hanley, James A.
collection	PubMed
description	In previous articles in the American Journal of Epidemiology (Am J Epidemiol. 2013;177(5):431–442) and American Journal of Public Health (Am J Public Health. 2013;103(10):1895–1901), Masters et al. reported age-specific hazard ratios for the contrasts in mortality rates between obesity categories. They corrected the observed hazard ratios for selection bias caused by what they postulated was the nonrepresentativeness of the participants in the National Health Interview Study that increased with age, obesity, and ill health. However, it is possible that their regression approach to remove the alleged bias has not produced, and in general cannot produce, sensible hazard ratio estimates. First, one must consider how many nonparticipants there might have been in each category of obesity and of age at entry and how much higher the mortality rates would have to be in nonparticipants than in participants in these same categories. What plausible set of numerical values would convert the (“biased”) decreasing-with-age hazard ratios seen in the data into the (“unbiased”) increasing-with-age ratios that they computed? Can these values be encapsulated in (and can sensible values be recovered from) 1 additional internal variable in a regression model? Second, one must examine the age pattern of the hazard ratios that have been adjusted for selection. Without the correction, the hazard ratios are attenuated with increasing age. With it, the hazard ratios at older ages are considerably higher, but those at younger ages are well below 1. Third, one must test whether the regression approach suggested by Masters et al. would correct the nonrepresentativeness that increased with age and ill health that I introduced into real and hypothetical data sets. I found that the approach did not recover the hazard ratio patterns present in the unselected data sets: The corrections overshot the target at older ages and undershot it at lower ages.
format	Online Article Text
id	pubmed-5962938
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-59629382018-06-04 Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias? Hanley, James A. Am J Epidemiol Commentary In previous articles in the American Journal of Epidemiology (Am J Epidemiol. 2013;177(5):431–442) and American Journal of Public Health (Am J Public Health. 2013;103(10):1895–1901), Masters et al. reported age-specific hazard ratios for the contrasts in mortality rates between obesity categories. They corrected the observed hazard ratios for selection bias caused by what they postulated was the nonrepresentativeness of the participants in the National Health Interview Study that increased with age, obesity, and ill health. However, it is possible that their regression approach to remove the alleged bias has not produced, and in general cannot produce, sensible hazard ratio estimates. First, one must consider how many nonparticipants there might have been in each category of obesity and of age at entry and how much higher the mortality rates would have to be in nonparticipants than in participants in these same categories. What plausible set of numerical values would convert the (“biased”) decreasing-with-age hazard ratios seen in the data into the (“unbiased”) increasing-with-age ratios that they computed? Can these values be encapsulated in (and can sensible values be recovered from) 1 additional internal variable in a regression model? Second, one must examine the age pattern of the hazard ratios that have been adjusted for selection. Without the correction, the hazard ratios are attenuated with increasing age. With it, the hazard ratios at older ages are considerably higher, but those at younger ages are well below 1. Third, one must test whether the regression approach suggested by Masters et al. would correct the nonrepresentativeness that increased with age and ill health that I introduced into real and hypothetical data sets. I found that the approach did not recover the hazard ratio patterns present in the unselected data sets: The corrections overshot the target at older ages and undershot it at lower ages. Oxford University Press 2017-03-15 2017-03-08 /pmc/articles/PMC5962938/ /pubmed/27923798 http://dx.doi.org/10.1093/aje/kww175 Text en © The Author 2017. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journalpermissions@oup.com.
spellingShingle	Commentary Hanley, James A. Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias?
title	Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias?
title_full	Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias?
title_fullStr	Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias?
title_full_unstemmed	Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias?
title_short	Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias?
title_sort	correction of selection bias in survey data: is the statistical cure worse than the bias?
topic	Commentary
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5962938/ https://www.ncbi.nlm.nih.gov/pubmed/27923798 http://dx.doi.org/10.1093/aje/kww175
work_keys_str_mv	AT hanleyjamesa correctionofselectionbiasinsurveydataisthestatisticalcureworsethanthebias

Correction of Selection Bias in Survey Data: Is the Statistical Cure Worse Than the Bias?

Ejemplares similares