Cargando…
A causal inference perspective on the analysis of compositional data
BACKGROUND: Compositional data comprise the parts of some whole, for which all parts sum to that whole. They are prevalent in many epidemiological contexts. Although many of the challenges associated with analysing compositional data have been discussed previously, we do so within a formal causal fr...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660155/ https://www.ncbi.nlm.nih.gov/pubmed/32154892 http://dx.doi.org/10.1093/ije/dyaa021 |
_version_ | 1783608951158341632 |
---|---|
author | Arnold, Kellyn F Berrie, Laurie Tennant, Peter W G Gilthorpe, Mark S |
author_facet | Arnold, Kellyn F Berrie, Laurie Tennant, Peter W G Gilthorpe, Mark S |
author_sort | Arnold, Kellyn F |
collection | PubMed |
description | BACKGROUND: Compositional data comprise the parts of some whole, for which all parts sum to that whole. They are prevalent in many epidemiological contexts. Although many of the challenges associated with analysing compositional data have been discussed previously, we do so within a formal causal framework by utilizing directed acyclic graphs (DAGs). METHODS: We depict compositional data using DAGs and identify two distinct effect estimands in the generic case: (i) the total effect, and (ii) the relative effect. We consider each in the context of three specific example scenarios involving compositional data: (1) the relationship between the economically active population and area-level gross domestic product; (2) the relationship between fat consumption and body weight; and (3) the relationship between time spent sedentary and body weight. For each, we consider the distinct interpretation of each effect, and the resulting implications for related analyses. RESULTS: For scenarios (1) and (2), both the total and relative effects may be identifiable and causally meaningful, depending upon the specific question of interest. For scenario (3), only the relative effect is identifiable. In all scenarios, the relative effect represents a joint effect, and thus requires careful interpretation. CONCLUSIONS: DAGs are useful for considering causal effects for compositional data. In all analyses involving compositional data, researchers should explicitly consider and declare which causal effect is sought and how it should be interpreted. |
format | Online Article Text |
id | pubmed-7660155 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-76601552020-11-18 A causal inference perspective on the analysis of compositional data Arnold, Kellyn F Berrie, Laurie Tennant, Peter W G Gilthorpe, Mark S Int J Epidemiol Methods BACKGROUND: Compositional data comprise the parts of some whole, for which all parts sum to that whole. They are prevalent in many epidemiological contexts. Although many of the challenges associated with analysing compositional data have been discussed previously, we do so within a formal causal framework by utilizing directed acyclic graphs (DAGs). METHODS: We depict compositional data using DAGs and identify two distinct effect estimands in the generic case: (i) the total effect, and (ii) the relative effect. We consider each in the context of three specific example scenarios involving compositional data: (1) the relationship between the economically active population and area-level gross domestic product; (2) the relationship between fat consumption and body weight; and (3) the relationship between time spent sedentary and body weight. For each, we consider the distinct interpretation of each effect, and the resulting implications for related analyses. RESULTS: For scenarios (1) and (2), both the total and relative effects may be identifiable and causally meaningful, depending upon the specific question of interest. For scenario (3), only the relative effect is identifiable. In all scenarios, the relative effect represents a joint effect, and thus requires careful interpretation. CONCLUSIONS: DAGs are useful for considering causal effects for compositional data. In all analyses involving compositional data, researchers should explicitly consider and declare which causal effect is sought and how it should be interpreted. Oxford University Press 2020-03-10 /pmc/articles/PMC7660155/ /pubmed/32154892 http://dx.doi.org/10.1093/ije/dyaa021 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the International Epidemiological Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Arnold, Kellyn F Berrie, Laurie Tennant, Peter W G Gilthorpe, Mark S A causal inference perspective on the analysis of compositional data |
title | A causal inference perspective on the analysis of compositional data |
title_full | A causal inference perspective on the analysis of compositional data |
title_fullStr | A causal inference perspective on the analysis of compositional data |
title_full_unstemmed | A causal inference perspective on the analysis of compositional data |
title_short | A causal inference perspective on the analysis of compositional data |
title_sort | causal inference perspective on the analysis of compositional data |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660155/ https://www.ncbi.nlm.nih.gov/pubmed/32154892 http://dx.doi.org/10.1093/ije/dyaa021 |
work_keys_str_mv | AT arnoldkellynf acausalinferenceperspectiveontheanalysisofcompositionaldata AT berrielaurie acausalinferenceperspectiveontheanalysisofcompositionaldata AT tennantpeterwg acausalinferenceperspectiveontheanalysisofcompositionaldata AT gilthorpemarks acausalinferenceperspectiveontheanalysisofcompositionaldata AT arnoldkellynf causalinferenceperspectiveontheanalysisofcompositionaldata AT berrielaurie causalinferenceperspectiveontheanalysisofcompositionaldata AT tennantpeterwg causalinferenceperspectiveontheanalysisofcompositionaldata AT gilthorpemarks causalinferenceperspectiveontheanalysisofcompositionaldata |