Cargando…

Challenges Raised by Mediation Analysis in a High-Dimension Setting

BACKGROUND: Mediation analysis is used in epidemiology to identify pathways through which exposures influence health. The advent of high-throughput (omics) technologies gives opportunities to perform mediation analysis with a high-dimension pool of covariates. OBJECTIVE: We aimed to highlight some b...

Descripción completa

Detalles Bibliográficos
Autores principales: Blum, Michaël G.B., Valeri, Linda, François, Olivier, Cadiou, Solène, Siroux, Valérie, Lepeule, Johanna, Slama, Rémy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Environmental Health Perspectives 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7263455/
https://www.ncbi.nlm.nih.gov/pubmed/32379489
http://dx.doi.org/10.1289/EHP6240
_version_ 1783540799951077376
author Blum, Michaël G.B.
Valeri, Linda
François, Olivier
Cadiou, Solène
Siroux, Valérie
Lepeule, Johanna
Slama, Rémy
author_facet Blum, Michaël G.B.
Valeri, Linda
François, Olivier
Cadiou, Solène
Siroux, Valérie
Lepeule, Johanna
Slama, Rémy
author_sort Blum, Michaël G.B.
collection PubMed
description BACKGROUND: Mediation analysis is used in epidemiology to identify pathways through which exposures influence health. The advent of high-throughput (omics) technologies gives opportunities to perform mediation analysis with a high-dimension pool of covariates. OBJECTIVE: We aimed to highlight some biostatistical issues of this expanding field of high-dimension mediation. DISCUSSION: The mediation techniques used for a single mediator cannot be generalized in a straightforward manner to high-dimension mediation. Causal knowledge on the relation between covariates is required for mediation analysis, and it is expected to be more limited as dimension and system complexity increase. The methods developed in high dimension can be distinguished according to whether mediators are considered separately or as a whole. Methods considering each potential mediator separately do not allow efficient identification of the indirect effects when mutual influences exist among the mediators, which is expected for many biological (e.g., epigenetic) parameters. In this context, methods considering all potential mediators simultaneously, based, for example, on data reduction techniques, are more adapted to the causal inference framework. Their cost is a possible lack of ability to single out the causal mediators. Moreover, the ability of the mediators to predict the outcome can be overestimated, in particular because many machine-learning algorithms are optimized to increase predictive ability rather than their aptitude to make causal inference. Given the lack of overarching validated framework and the generally complex causal structure of high-dimension data, analysis of high-dimension mediation currently requires great caution and effort to incorporate a priori biological knowledge. https://doi.org/10.1289/EHP6240
format Online
Article
Text
id pubmed-7263455
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Environmental Health Perspectives
record_format MEDLINE/PubMed
spelling pubmed-72634552020-06-04 Challenges Raised by Mediation Analysis in a High-Dimension Setting Blum, Michaël G.B. Valeri, Linda François, Olivier Cadiou, Solène Siroux, Valérie Lepeule, Johanna Slama, Rémy Environ Health Perspect Commentary BACKGROUND: Mediation analysis is used in epidemiology to identify pathways through which exposures influence health. The advent of high-throughput (omics) technologies gives opportunities to perform mediation analysis with a high-dimension pool of covariates. OBJECTIVE: We aimed to highlight some biostatistical issues of this expanding field of high-dimension mediation. DISCUSSION: The mediation techniques used for a single mediator cannot be generalized in a straightforward manner to high-dimension mediation. Causal knowledge on the relation between covariates is required for mediation analysis, and it is expected to be more limited as dimension and system complexity increase. The methods developed in high dimension can be distinguished according to whether mediators are considered separately or as a whole. Methods considering each potential mediator separately do not allow efficient identification of the indirect effects when mutual influences exist among the mediators, which is expected for many biological (e.g., epigenetic) parameters. In this context, methods considering all potential mediators simultaneously, based, for example, on data reduction techniques, are more adapted to the causal inference framework. Their cost is a possible lack of ability to single out the causal mediators. Moreover, the ability of the mediators to predict the outcome can be overestimated, in particular because many machine-learning algorithms are optimized to increase predictive ability rather than their aptitude to make causal inference. Given the lack of overarching validated framework and the generally complex causal structure of high-dimension data, analysis of high-dimension mediation currently requires great caution and effort to incorporate a priori biological knowledge. https://doi.org/10.1289/EHP6240 Environmental Health Perspectives 2020-05-06 /pmc/articles/PMC7263455/ /pubmed/32379489 http://dx.doi.org/10.1289/EHP6240 Text en https://ehp.niehs.nih.gov/about-ehp/license EHP is an open-access journal published with support from the National Institute of Environmental Health Sciences, National Institutes of Health. All content is public domain unless otherwise noted.
spellingShingle Commentary
Blum, Michaël G.B.
Valeri, Linda
François, Olivier
Cadiou, Solène
Siroux, Valérie
Lepeule, Johanna
Slama, Rémy
Challenges Raised by Mediation Analysis in a High-Dimension Setting
title Challenges Raised by Mediation Analysis in a High-Dimension Setting
title_full Challenges Raised by Mediation Analysis in a High-Dimension Setting
title_fullStr Challenges Raised by Mediation Analysis in a High-Dimension Setting
title_full_unstemmed Challenges Raised by Mediation Analysis in a High-Dimension Setting
title_short Challenges Raised by Mediation Analysis in a High-Dimension Setting
title_sort challenges raised by mediation analysis in a high-dimension setting
topic Commentary
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7263455/
https://www.ncbi.nlm.nih.gov/pubmed/32379489
http://dx.doi.org/10.1289/EHP6240
work_keys_str_mv AT blummichaelgb challengesraisedbymediationanalysisinahighdimensionsetting
AT valerilinda challengesraisedbymediationanalysisinahighdimensionsetting
AT francoisolivier challengesraisedbymediationanalysisinahighdimensionsetting
AT cadiousolene challengesraisedbymediationanalysisinahighdimensionsetting
AT sirouxvalerie challengesraisedbymediationanalysisinahighdimensionsetting
AT lepeulejohanna challengesraisedbymediationanalysisinahighdimensionsetting
AT slamaremy challengesraisedbymediationanalysisinahighdimensionsetting