Cargando…
The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data
BACKGROUND: The Australian Longitudinal Study on Male Health (Ten to Men) used a complex sampling scheme to identify potential participants for the baseline survey. This raises important questions about when and how to adjust for the sampling design when analyzing data from the baseline survey. METH...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5103251/ https://www.ncbi.nlm.nih.gov/pubmed/28185562 http://dx.doi.org/10.1186/s12889-016-3699-0 |
_version_ | 1782466560555745280 |
---|---|
author | Spittal, Matthew J. Carlin, John B. Currier, Dianne Downes, Marnie English, Dallas R. Gordon, Ian Pirkis, Jane Gurrin, Lyle |
author_facet | Spittal, Matthew J. Carlin, John B. Currier, Dianne Downes, Marnie English, Dallas R. Gordon, Ian Pirkis, Jane Gurrin, Lyle |
author_sort | Spittal, Matthew J. |
collection | PubMed |
description | BACKGROUND: The Australian Longitudinal Study on Male Health (Ten to Men) used a complex sampling scheme to identify potential participants for the baseline survey. This raises important questions about when and how to adjust for the sampling design when analyzing data from the baseline survey. METHODS: We describe the sampling scheme used in Ten to Men focusing on four important elements: stratification, multi-stage sampling, clustering and sample weights. We discuss how these elements fit together when using baseline data to estimate a population parameter (e.g., population mean or prevalence) or to estimate the association between an exposure and an outcome (e.g., an odds ratio). We illustrate this with examples using a continuous outcome (weight in kilograms) and a binary outcome (smoking status). RESULTS: Estimates of a population mean or disease prevalence using Ten to Men baseline data are influenced by the extent to which the sampling design is addressed in an analysis. Estimates of mean weight and smoking prevalence are larger in unweighted analyses than weighted analyses (e.g., mean = 83.9 kg vs. 81.4 kg; prevalence = 18.0 % vs. 16.7 %, for unweighted and weighted analyses respectively) and the standard error of the mean is 1.03 times larger in an analysis that acknowledges the hierarchical (clustered) structure of the data compared with one that does not. For smoking prevalence, the corresponding standard error is 1.07 times larger. Measures of association (mean group differences, odds ratios) are generally similar in unweighted or weighted analyses and whether or not adjustment is made for clustering. CONCLUSIONS: The extent to which the Ten to Men sampling design is accounted for in any analysis of the baseline data will depend on the research question. When the goals of the analysis are to estimate the prevalence of a disease or risk factor in the population or the magnitude of a population-level exposure-outcome association, our advice is to adopt an analysis that respects the sampling design. |
format | Online Article Text |
id | pubmed-5103251 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-51032512016-11-10 The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data Spittal, Matthew J. Carlin, John B. Currier, Dianne Downes, Marnie English, Dallas R. Gordon, Ian Pirkis, Jane Gurrin, Lyle BMC Public Health Research BACKGROUND: The Australian Longitudinal Study on Male Health (Ten to Men) used a complex sampling scheme to identify potential participants for the baseline survey. This raises important questions about when and how to adjust for the sampling design when analyzing data from the baseline survey. METHODS: We describe the sampling scheme used in Ten to Men focusing on four important elements: stratification, multi-stage sampling, clustering and sample weights. We discuss how these elements fit together when using baseline data to estimate a population parameter (e.g., population mean or prevalence) or to estimate the association between an exposure and an outcome (e.g., an odds ratio). We illustrate this with examples using a continuous outcome (weight in kilograms) and a binary outcome (smoking status). RESULTS: Estimates of a population mean or disease prevalence using Ten to Men baseline data are influenced by the extent to which the sampling design is addressed in an analysis. Estimates of mean weight and smoking prevalence are larger in unweighted analyses than weighted analyses (e.g., mean = 83.9 kg vs. 81.4 kg; prevalence = 18.0 % vs. 16.7 %, for unweighted and weighted analyses respectively) and the standard error of the mean is 1.03 times larger in an analysis that acknowledges the hierarchical (clustered) structure of the data compared with one that does not. For smoking prevalence, the corresponding standard error is 1.07 times larger. Measures of association (mean group differences, odds ratios) are generally similar in unweighted or weighted analyses and whether or not adjustment is made for clustering. CONCLUSIONS: The extent to which the Ten to Men sampling design is accounted for in any analysis of the baseline data will depend on the research question. When the goals of the analysis are to estimate the prevalence of a disease or risk factor in the population or the magnitude of a population-level exposure-outcome association, our advice is to adopt an analysis that respects the sampling design. BioMed Central 2016-10-31 /pmc/articles/PMC5103251/ /pubmed/28185562 http://dx.doi.org/10.1186/s12889-016-3699-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Spittal, Matthew J. Carlin, John B. Currier, Dianne Downes, Marnie English, Dallas R. Gordon, Ian Pirkis, Jane Gurrin, Lyle The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data |
title | The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data |
title_full | The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data |
title_fullStr | The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data |
title_full_unstemmed | The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data |
title_short | The Australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data |
title_sort | australian longitudinal study on male health sampling design and survey weighting: implications for analysis and interpretation of clustered data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5103251/ https://www.ncbi.nlm.nih.gov/pubmed/28185562 http://dx.doi.org/10.1186/s12889-016-3699-0 |
work_keys_str_mv | AT spittalmatthewj theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT carlinjohnb theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT currierdianne theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT downesmarnie theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT englishdallasr theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT gordonian theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT pirkisjane theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT gurrinlyle theaustralianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT spittalmatthewj australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT carlinjohnb australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT currierdianne australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT downesmarnie australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT englishdallasr australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT gordonian australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT pirkisjane australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata AT gurrinlyle australianlongitudinalstudyonmalehealthsamplingdesignandsurveyweightingimplicationsforanalysisandinterpretationofclustereddata |