Cargando…

Estimation of total mediation effect for high-dimensional omics mediators

BACKGROUND: Environmental exposures can regulate intermediate molecular phenotypes, such as gene expression, by different mechanisms and thereby lead to various health outcomes. It is of significant scientific interest to unravel the role of potentially high-dimensional intermediate phenotypes in th...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Tianzhong, Niu, Jingbo, Chen, Han, Wei, Peng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8381496/
https://www.ncbi.nlm.nih.gov/pubmed/34425752
http://dx.doi.org/10.1186/s12859-021-04322-1
_version_ 1783741380981424128
author Yang, Tianzhong
Niu, Jingbo
Chen, Han
Wei, Peng
author_facet Yang, Tianzhong
Niu, Jingbo
Chen, Han
Wei, Peng
author_sort Yang, Tianzhong
collection PubMed
description BACKGROUND: Environmental exposures can regulate intermediate molecular phenotypes, such as gene expression, by different mechanisms and thereby lead to various health outcomes. It is of significant scientific interest to unravel the role of potentially high-dimensional intermediate phenotypes in the relationship between environmental exposure and traits. Mediation analysis is an important tool for investigating such relationships. However, it has mainly focused on low-dimensional settings, and there is a lack of a good measure of the total mediation effect. Here, we extend an R-squared (R[Formula: see text] ) effect size measure, originally proposed in the single-mediator setting, to the moderate- and high-dimensional mediator settings in the mixed model framework. RESULTS: Based on extensive simulations, we compare our measure and estimation procedure with several frequently used mediation measures, including product, proportion, and ratio measures. Our R[Formula: see text] -based second-moment measure has small bias and variance under the correctly specified model. To mitigate potential bias induced by non-mediators, we examine two variable selection procedures, i.e., iterative sure independence screening and false discovery rate control, to exclude the non-mediators. We establish the consistency of the proposed estimation procedures and introduce a resampling-based confidence interval. By applying the proposed estimation procedure, we found that 38% of the age-related variations in systolic blood pressure can be explained by gene expression profiles in the Framingham Heart Study of 1711 individuals. An R package “RsqMed” is available on CRAN. CONCLUSION: R-squared (R[Formula: see text] ) is an effective and efficient measure for total mediation effect especially under high-dimensional setting. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04322-1.
format Online
Article
Text
id pubmed-8381496
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-83814962021-08-23 Estimation of total mediation effect for high-dimensional omics mediators Yang, Tianzhong Niu, Jingbo Chen, Han Wei, Peng BMC Bioinformatics Methodology Article BACKGROUND: Environmental exposures can regulate intermediate molecular phenotypes, such as gene expression, by different mechanisms and thereby lead to various health outcomes. It is of significant scientific interest to unravel the role of potentially high-dimensional intermediate phenotypes in the relationship between environmental exposure and traits. Mediation analysis is an important tool for investigating such relationships. However, it has mainly focused on low-dimensional settings, and there is a lack of a good measure of the total mediation effect. Here, we extend an R-squared (R[Formula: see text] ) effect size measure, originally proposed in the single-mediator setting, to the moderate- and high-dimensional mediator settings in the mixed model framework. RESULTS: Based on extensive simulations, we compare our measure and estimation procedure with several frequently used mediation measures, including product, proportion, and ratio measures. Our R[Formula: see text] -based second-moment measure has small bias and variance under the correctly specified model. To mitigate potential bias induced by non-mediators, we examine two variable selection procedures, i.e., iterative sure independence screening and false discovery rate control, to exclude the non-mediators. We establish the consistency of the proposed estimation procedures and introduce a resampling-based confidence interval. By applying the proposed estimation procedure, we found that 38% of the age-related variations in systolic blood pressure can be explained by gene expression profiles in the Framingham Heart Study of 1711 individuals. An R package “RsqMed” is available on CRAN. CONCLUSION: R-squared (R[Formula: see text] ) is an effective and efficient measure for total mediation effect especially under high-dimensional setting. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04322-1. BioMed Central 2021-08-23 /pmc/articles/PMC8381496/ /pubmed/34425752 http://dx.doi.org/10.1186/s12859-021-04322-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Yang, Tianzhong
Niu, Jingbo
Chen, Han
Wei, Peng
Estimation of total mediation effect for high-dimensional omics mediators
title Estimation of total mediation effect for high-dimensional omics mediators
title_full Estimation of total mediation effect for high-dimensional omics mediators
title_fullStr Estimation of total mediation effect for high-dimensional omics mediators
title_full_unstemmed Estimation of total mediation effect for high-dimensional omics mediators
title_short Estimation of total mediation effect for high-dimensional omics mediators
title_sort estimation of total mediation effect for high-dimensional omics mediators
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8381496/
https://www.ncbi.nlm.nih.gov/pubmed/34425752
http://dx.doi.org/10.1186/s12859-021-04322-1
work_keys_str_mv AT yangtianzhong estimationoftotalmediationeffectforhighdimensionalomicsmediators
AT niujingbo estimationoftotalmediationeffectforhighdimensionalomicsmediators
AT chenhan estimationoftotalmediationeffectforhighdimensionalomicsmediators
AT weipeng estimationoftotalmediationeffectforhighdimensionalomicsmediators