Cargando…

A causal inference perspective on the analysis of compositional data

BACKGROUND: Compositional data comprise the parts of some whole, for which all parts sum to that whole. They are prevalent in many epidemiological contexts. Although many of the challenges associated with analysing compositional data have been discussed previously, we do so within a formal causal fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Arnold, Kellyn F, Berrie, Laurie, Tennant, Peter W G, Gilthorpe, Mark S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660155/
https://www.ncbi.nlm.nih.gov/pubmed/32154892
http://dx.doi.org/10.1093/ije/dyaa021
_version_ 1783608951158341632
author Arnold, Kellyn F
Berrie, Laurie
Tennant, Peter W G
Gilthorpe, Mark S
author_facet Arnold, Kellyn F
Berrie, Laurie
Tennant, Peter W G
Gilthorpe, Mark S
author_sort Arnold, Kellyn F
collection PubMed
description BACKGROUND: Compositional data comprise the parts of some whole, for which all parts sum to that whole. They are prevalent in many epidemiological contexts. Although many of the challenges associated with analysing compositional data have been discussed previously, we do so within a formal causal framework by utilizing directed acyclic graphs (DAGs). METHODS: We depict compositional data using DAGs and identify two distinct effect estimands in the generic case: (i) the total effect, and (ii) the relative effect. We consider each in the context of three specific example scenarios involving compositional data: (1) the relationship between the economically active population and area-level gross domestic product; (2) the relationship between fat consumption and body weight; and (3) the relationship between time spent sedentary and body weight. For each, we consider the distinct interpretation of each effect, and the resulting implications for related analyses. RESULTS: For scenarios (1) and (2), both the total and relative effects may be identifiable and causally meaningful, depending upon the specific question of interest. For scenario (3), only the relative effect is identifiable. In all scenarios, the relative effect represents a joint effect, and thus requires careful interpretation. CONCLUSIONS: DAGs are useful for considering causal effects for compositional data. In all analyses involving compositional data, researchers should explicitly consider and declare which causal effect is sought and how it should be interpreted.
format Online
Article
Text
id pubmed-7660155
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-76601552020-11-18 A causal inference perspective on the analysis of compositional data Arnold, Kellyn F Berrie, Laurie Tennant, Peter W G Gilthorpe, Mark S Int J Epidemiol Methods BACKGROUND: Compositional data comprise the parts of some whole, for which all parts sum to that whole. They are prevalent in many epidemiological contexts. Although many of the challenges associated with analysing compositional data have been discussed previously, we do so within a formal causal framework by utilizing directed acyclic graphs (DAGs). METHODS: We depict compositional data using DAGs and identify two distinct effect estimands in the generic case: (i) the total effect, and (ii) the relative effect. We consider each in the context of three specific example scenarios involving compositional data: (1) the relationship between the economically active population and area-level gross domestic product; (2) the relationship between fat consumption and body weight; and (3) the relationship between time spent sedentary and body weight. For each, we consider the distinct interpretation of each effect, and the resulting implications for related analyses. RESULTS: For scenarios (1) and (2), both the total and relative effects may be identifiable and causally meaningful, depending upon the specific question of interest. For scenario (3), only the relative effect is identifiable. In all scenarios, the relative effect represents a joint effect, and thus requires careful interpretation. CONCLUSIONS: DAGs are useful for considering causal effects for compositional data. In all analyses involving compositional data, researchers should explicitly consider and declare which causal effect is sought and how it should be interpreted. Oxford University Press 2020-03-10 /pmc/articles/PMC7660155/ /pubmed/32154892 http://dx.doi.org/10.1093/ije/dyaa021 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the International Epidemiological Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Arnold, Kellyn F
Berrie, Laurie
Tennant, Peter W G
Gilthorpe, Mark S
A causal inference perspective on the analysis of compositional data
title A causal inference perspective on the analysis of compositional data
title_full A causal inference perspective on the analysis of compositional data
title_fullStr A causal inference perspective on the analysis of compositional data
title_full_unstemmed A causal inference perspective on the analysis of compositional data
title_short A causal inference perspective on the analysis of compositional data
title_sort causal inference perspective on the analysis of compositional data
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660155/
https://www.ncbi.nlm.nih.gov/pubmed/32154892
http://dx.doi.org/10.1093/ije/dyaa021
work_keys_str_mv AT arnoldkellynf acausalinferenceperspectiveontheanalysisofcompositionaldata
AT berrielaurie acausalinferenceperspectiveontheanalysisofcompositionaldata
AT tennantpeterwg acausalinferenceperspectiveontheanalysisofcompositionaldata
AT gilthorpemarks acausalinferenceperspectiveontheanalysisofcompositionaldata
AT arnoldkellynf causalinferenceperspectiveontheanalysisofcompositionaldata
AT berrielaurie causalinferenceperspectiveontheanalysisofcompositionaldata
AT tennantpeterwg causalinferenceperspectiveontheanalysisofcompositionaldata
AT gilthorpemarks causalinferenceperspectiveontheanalysisofcompositionaldata