Cargando…

Using variable importance measures from causal inference to rank risk factors of schistosomiasis infection in a rural setting in China

BACKGROUND: Schistosomiasis infection, contracted through contact with contaminated water, is a global public health concern. In this paper we analyze data from a retrospective study reporting water contact and schistosomiasis infection status among 1011 individuals in rural China. We present semi-p...

Descripción completa

Detalles Bibliográficos
Autores principales: Sudat, Sylvia EK, Carlton, Elizabeth J, Seto, Edmund YW, Spear, Robert C, Hubbard, Alan E
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2913928/
https://www.ncbi.nlm.nih.gov/pubmed/20626918
http://dx.doi.org/10.1186/1742-5573-7-3
_version_ 1782184707097624576
author Sudat, Sylvia EK
Carlton, Elizabeth J
Seto, Edmund YW
Spear, Robert C
Hubbard, Alan E
author_facet Sudat, Sylvia EK
Carlton, Elizabeth J
Seto, Edmund YW
Spear, Robert C
Hubbard, Alan E
author_sort Sudat, Sylvia EK
collection PubMed
description BACKGROUND: Schistosomiasis infection, contracted through contact with contaminated water, is a global public health concern. In this paper we analyze data from a retrospective study reporting water contact and schistosomiasis infection status among 1011 individuals in rural China. We present semi-parametric methods for identifying risk factors through a comparison of three analysis approaches: a prediction-focused machine learning algorithm, a simple main-effects multivariable regression, and a semi-parametric variable importance (VI) estimate inspired by a causal population intervention parameter. RESULTS: The multivariable regression found only tool washing to be associated with the outcome, with a relative risk of 1.03 and a 95% confidence interval (CI) of 1.01-1.05. Three types of water contact were found to be associated with the outcome in the semi-parametric VI analysis: July water contact (VI estimate 0.16, 95% CI 0.11-0.22), water contact from tool washing (VI estimate 0.88, 95% CI 0.80-0.97), and water contact from rice planting (VI estimate 0.71, 95% CI 0.53-0.96). The July VI result, in particular, indicated a strong association with infection status - its causal interpretation implies that eliminating water contact in July would reduce the prevalence of schistosomiasis in our study population by 84%, or from 0.3 to 0.05 (95% CI 78%-89%). CONCLUSIONS: The July VI estimate suggests possible within-season variability in schistosomiasis infection risk, an association not detected by the regression analysis. Though there are many limitations to this study that temper the potential for causal interpretations, if a high-risk time period could be detected in something close to real time, new prevention options would be opened. Most importantly, we emphasize that traditional regression approaches are usually based on arbitrary pre-specified models, making their parameters difficult to interpret in the context of real-world applications. Our results support the practical application of analysis approaches that, in contrast, do not require arbitrary model pre-specification, estimate parameters that have simple public health interpretations, and apply inference that considers model selection as a source of variation.
format Text
id pubmed-2913928
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29139282010-08-11 Using variable importance measures from causal inference to rank risk factors of schistosomiasis infection in a rural setting in China Sudat, Sylvia EK Carlton, Elizabeth J Seto, Edmund YW Spear, Robert C Hubbard, Alan E Epidemiol Perspect Innov Research BACKGROUND: Schistosomiasis infection, contracted through contact with contaminated water, is a global public health concern. In this paper we analyze data from a retrospective study reporting water contact and schistosomiasis infection status among 1011 individuals in rural China. We present semi-parametric methods for identifying risk factors through a comparison of three analysis approaches: a prediction-focused machine learning algorithm, a simple main-effects multivariable regression, and a semi-parametric variable importance (VI) estimate inspired by a causal population intervention parameter. RESULTS: The multivariable regression found only tool washing to be associated with the outcome, with a relative risk of 1.03 and a 95% confidence interval (CI) of 1.01-1.05. Three types of water contact were found to be associated with the outcome in the semi-parametric VI analysis: July water contact (VI estimate 0.16, 95% CI 0.11-0.22), water contact from tool washing (VI estimate 0.88, 95% CI 0.80-0.97), and water contact from rice planting (VI estimate 0.71, 95% CI 0.53-0.96). The July VI result, in particular, indicated a strong association with infection status - its causal interpretation implies that eliminating water contact in July would reduce the prevalence of schistosomiasis in our study population by 84%, or from 0.3 to 0.05 (95% CI 78%-89%). CONCLUSIONS: The July VI estimate suggests possible within-season variability in schistosomiasis infection risk, an association not detected by the regression analysis. Though there are many limitations to this study that temper the potential for causal interpretations, if a high-risk time period could be detected in something close to real time, new prevention options would be opened. Most importantly, we emphasize that traditional regression approaches are usually based on arbitrary pre-specified models, making their parameters difficult to interpret in the context of real-world applications. Our results support the practical application of analysis approaches that, in contrast, do not require arbitrary model pre-specification, estimate parameters that have simple public health interpretations, and apply inference that considers model selection as a source of variation. BioMed Central 2010-07-14 /pmc/articles/PMC2913928/ /pubmed/20626918 http://dx.doi.org/10.1186/1742-5573-7-3 Text en Copyright ©2010 Sudat et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Sudat, Sylvia EK
Carlton, Elizabeth J
Seto, Edmund YW
Spear, Robert C
Hubbard, Alan E
Using variable importance measures from causal inference to rank risk factors of schistosomiasis infection in a rural setting in China
title Using variable importance measures from causal inference to rank risk factors of schistosomiasis infection in a rural setting in China
title_full Using variable importance measures from causal inference to rank risk factors of schistosomiasis infection in a rural setting in China
title_fullStr Using variable importance measures from causal inference to rank risk factors of schistosomiasis infection in a rural setting in China
title_full_unstemmed Using variable importance measures from causal inference to rank risk factors of schistosomiasis infection in a rural setting in China
title_short Using variable importance measures from causal inference to rank risk factors of schistosomiasis infection in a rural setting in China
title_sort using variable importance measures from causal inference to rank risk factors of schistosomiasis infection in a rural setting in china
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2913928/
https://www.ncbi.nlm.nih.gov/pubmed/20626918
http://dx.doi.org/10.1186/1742-5573-7-3
work_keys_str_mv AT sudatsylviaek usingvariableimportancemeasuresfromcausalinferencetorankriskfactorsofschistosomiasisinfectioninaruralsettinginchina
AT carltonelizabethj usingvariableimportancemeasuresfromcausalinferencetorankriskfactorsofschistosomiasisinfectioninaruralsettinginchina
AT setoedmundyw usingvariableimportancemeasuresfromcausalinferencetorankriskfactorsofschistosomiasisinfectioninaruralsettinginchina
AT spearrobertc usingvariableimportancemeasuresfromcausalinferencetorankriskfactorsofschistosomiasisinfectioninaruralsettinginchina
AT hubbardalane usingvariableimportancemeasuresfromcausalinferencetorankriskfactorsofschistosomiasisinfectioninaruralsettinginchina