Cargando…

A unified mediation analysis framework for integrative cancer proteogenomics with clinical outcomes

MOTIVATION: Multilevel molecular profiling of tumors and the integrative analysis with clinical outcomes have enabled a deeper characterization of cancer treatment. Mediation analysis has emerged as a promising statistical tool to identify and quantify the intermediate mechanisms by which a gene aff...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Licai, Long, James P, Irajizad, Ehsan, Doecke, James D, Do, Kim-Anh, Ha, Min Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9879726/
https://www.ncbi.nlm.nih.gov/pubmed/36648331
http://dx.doi.org/10.1093/bioinformatics/btad023
_version_ 1784878754856697856
author Huang, Licai
Long, James P
Irajizad, Ehsan
Doecke, James D
Do, Kim-Anh
Ha, Min Jin
author_facet Huang, Licai
Long, James P
Irajizad, Ehsan
Doecke, James D
Do, Kim-Anh
Ha, Min Jin
author_sort Huang, Licai
collection PubMed
description MOTIVATION: Multilevel molecular profiling of tumors and the integrative analysis with clinical outcomes have enabled a deeper characterization of cancer treatment. Mediation analysis has emerged as a promising statistical tool to identify and quantify the intermediate mechanisms by which a gene affects an outcome. However, existing methods lack a unified approach to handle various types of outcome variables, making them unsuitable for high-throughput molecular profiling data with highly interconnected variables. RESULTS: We develop a general mediation analysis framework for proteogenomic data that include multiple exposures, multivariate mediators on various scales of effects as appropriate for continuous, binary and survival outcomes. Our estimation method avoids imposing constraints on model parameters such as the rare disease assumption, while accommodating multiple exposures and high-dimensional mediators. We compare our approach to other methods in extensive simulation studies at a range of sample sizes, disease prevalence and number of false mediators. Using kidney renal clear cell carcinoma proteogenomic data, we identify genes that are mediated by proteins and the underlying mechanisms on various survival outcomes that capture short- and long-term disease-specific clinical characteristics. AVAILABILITY AND IMPLEMENTATION: Software is made available in an R package (https://github.com/longjp/mediateR). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9879726
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98797262023-01-31 A unified mediation analysis framework for integrative cancer proteogenomics with clinical outcomes Huang, Licai Long, James P Irajizad, Ehsan Doecke, James D Do, Kim-Anh Ha, Min Jin Bioinformatics Original Paper MOTIVATION: Multilevel molecular profiling of tumors and the integrative analysis with clinical outcomes have enabled a deeper characterization of cancer treatment. Mediation analysis has emerged as a promising statistical tool to identify and quantify the intermediate mechanisms by which a gene affects an outcome. However, existing methods lack a unified approach to handle various types of outcome variables, making them unsuitable for high-throughput molecular profiling data with highly interconnected variables. RESULTS: We develop a general mediation analysis framework for proteogenomic data that include multiple exposures, multivariate mediators on various scales of effects as appropriate for continuous, binary and survival outcomes. Our estimation method avoids imposing constraints on model parameters such as the rare disease assumption, while accommodating multiple exposures and high-dimensional mediators. We compare our approach to other methods in extensive simulation studies at a range of sample sizes, disease prevalence and number of false mediators. Using kidney renal clear cell carcinoma proteogenomic data, we identify genes that are mediated by proteins and the underlying mechanisms on various survival outcomes that capture short- and long-term disease-specific clinical characteristics. AVAILABILITY AND IMPLEMENTATION: Software is made available in an R package (https://github.com/longjp/mediateR). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-01-17 /pmc/articles/PMC9879726/ /pubmed/36648331 http://dx.doi.org/10.1093/bioinformatics/btad023 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Huang, Licai
Long, James P
Irajizad, Ehsan
Doecke, James D
Do, Kim-Anh
Ha, Min Jin
A unified mediation analysis framework for integrative cancer proteogenomics with clinical outcomes
title A unified mediation analysis framework for integrative cancer proteogenomics with clinical outcomes
title_full A unified mediation analysis framework for integrative cancer proteogenomics with clinical outcomes
title_fullStr A unified mediation analysis framework for integrative cancer proteogenomics with clinical outcomes
title_full_unstemmed A unified mediation analysis framework for integrative cancer proteogenomics with clinical outcomes
title_short A unified mediation analysis framework for integrative cancer proteogenomics with clinical outcomes
title_sort unified mediation analysis framework for integrative cancer proteogenomics with clinical outcomes
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9879726/
https://www.ncbi.nlm.nih.gov/pubmed/36648331
http://dx.doi.org/10.1093/bioinformatics/btad023
work_keys_str_mv AT huanglicai aunifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT longjamesp aunifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT irajizadehsan aunifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT doeckejamesd aunifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT dokimanh aunifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT haminjin aunifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT huanglicai unifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT longjamesp unifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT irajizadehsan unifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT doeckejamesd unifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT dokimanh unifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes
AT haminjin unifiedmediationanalysisframeworkforintegrativecancerproteogenomicswithclinicaloutcomes