Cargando…

The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data

BACKGROUND: The Australian Longitudinal Study on Male Health (Ten to Men) used a complex sampling scheme to identify potential participants for the baseline survey. This raises important questions about when and how to adjust for the sampling design when analyzing data from the baseline survey. METH...

Descripción completa

Detalles Bibliográficos
Autores principales: Spittal, Matthew J., Carlin, John B., Currier, Dianne, Downes, Marnie, English, Dallas R., Gordon, Ian, Pirkis, Jane, Gurrin, Lyle
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5103251/
https://www.ncbi.nlm.nih.gov/pubmed/28185562
http://dx.doi.org/10.1186/s12889-016-3699-0
_version_ 1782466560555745280
author Spittal, Matthew J.
Carlin, John B.
Currier, Dianne
Downes, Marnie
English, Dallas R.
Gordon, Ian
Pirkis, Jane
Gurrin, Lyle
author_facet Spittal, Matthew J.
Carlin, John B.
Currier, Dianne
Downes, Marnie
English, Dallas R.
Gordon, Ian
Pirkis, Jane
Gurrin, Lyle
author_sort Spittal, Matthew J.
collection PubMed
description BACKGROUND: The Australian Longitudinal Study on Male Health (Ten to Men) used a complex sampling scheme to identify potential participants for the baseline survey. This raises important questions about when and how to adjust for the sampling design when analyzing data from the baseline survey. METHODS: We describe the sampling scheme used in Ten to Men focusing on four important elements: stratification, multi-stage sampling, clustering and sample weights. We discuss how these elements fit together when using baseline data to estimate a population parameter (e.g., population mean or prevalence) or to estimate the association between an exposure and an outcome (e.g., an odds ratio). We illustrate this with examples using a continuous outcome (weight in kilograms) and a binary outcome (smoking status). RESULTS: Estimates of a population mean or disease prevalence using Ten to Men baseline data are influenced by the extent to which the sampling design is addressed in an analysis. Estimates of mean weight and smoking prevalence are larger in unweighted analyses than weighted analyses (e.g., mean = 83.9 kg vs. 81.4 kg; prevalence = 18.0 % vs. 16.7 %, for unweighted and weighted analyses respectively) and the standard error of the mean is 1.03 times larger in an analysis that acknowledges the hierarchical (clustered) structure of the data compared with one that does not. For smoking prevalence, the corresponding standard error is 1.07 times larger. Measures of association (mean group differences, odds ratios) are generally similar in unweighted or weighted analyses and whether or not adjustment is made for clustering. CONCLUSIONS: The extent to which the Ten to Men sampling design is accounted for in any analysis of the baseline data will depend on the research question. When the goals of the analysis are to estimate the prevalence of a disease or risk factor in the population or the magnitude of a population-level exposure-outcome association, our advice is to adopt an analysis that respects the sampling design.
format Online
Article
Text
id pubmed-5103251
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-51032512016-11-10 The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data Spittal, Matthew J. Carlin, John B. Currier, Dianne Downes, Marnie English, Dallas R. Gordon, Ian Pirkis, Jane Gurrin, Lyle BMC Public Health Research BACKGROUND: The Australian Longitudinal Study on Male Health (Ten to Men) used a complex sampling scheme to identify potential participants for the baseline survey. This raises important questions about when and how to adjust for the sampling design when analyzing data from the baseline survey. METHODS: We describe the sampling scheme used in Ten to Men focusing on four important elements: stratification, multi-stage sampling, clustering and sample weights. We discuss how these elements fit together when using baseline data to estimate a population parameter (e.g., population mean or prevalence) or to estimate the association between an exposure and an outcome (e.g., an odds ratio). We illustrate this with examples using a continuous outcome (weight in kilograms) and a binary outcome (smoking status). RESULTS: Estimates of a population mean or disease prevalence using Ten to Men baseline data are influenced by the extent to which the sampling design is addressed in an analysis. Estimates of mean weight and smoking prevalence are larger in unweighted analyses than weighted analyses (e.g., mean = 83.9 kg vs. 81.4 kg; prevalence = 18.0 % vs. 16.7 %, for unweighted and weighted analyses respectively) and the standard error of the mean is 1.03 times larger in an analysis that acknowledges the hierarchical (clustered) structure of the data compared with one that does not. For smoking prevalence, the corresponding standard error is 1.07 times larger. Measures of association (mean group differences, odds ratios) are generally similar in unweighted or weighted analyses and whether or not adjustment is made for clustering. CONCLUSIONS: The extent to which the Ten to Men sampling design is accounted for in any analysis of the baseline data will depend on the research question. When the goals of the analysis are to estimate the prevalence of a disease or risk factor in the population or the magnitude of a population-level exposure-outcome association, our advice is to adopt an analysis that respects the sampling design. BioMed Central 2016-10-31 /pmc/articles/PMC5103251/ /pubmed/28185562 http://dx.doi.org/10.1186/s12889-016-3699-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Spittal, Matthew J.
Carlin, John B.
Currier, Dianne
Downes, Marnie
English, Dallas R.
Gordon, Ian
Pirkis, Jane
Gurrin, Lyle
The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data
title The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data
title_full The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data
title_fullStr The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data
title_full_unstemmed The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data
title_short The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data
title_sort australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5103251/
https://www.ncbi.nlm.nih.gov/pubmed/28185562
http://dx.doi.org/10.1186/s12889-016-3699-0
work_keys_str_mv AT spittalmatthewj theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT carlinjohnb theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT currierdianne theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT downesmarnie theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT englishdallasr theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT gordonian theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT pirkisjane theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT gurrinlyle theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT spittalmatthewj australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT carlinjohnb australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT currierdianne australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT downesmarnie australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT englishdallasr australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT gordonian australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT pirkisjane australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata
AT gurrinlyle australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata