Cargando…

Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates

While the promise of electronic medical record and biobank data is large, major questions remain about patient privacy, computational hurdles, and data access. One promising area of recent development is pre-computing non-individually identifiable summary statistics to be made publicly available for...

Descripción completa

Detalles Bibliográficos
Autores principales: Wolf , Jack M., Westra, Jason, Tintle, Nathan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8546319/
https://www.ncbi.nlm.nih.gov/pubmed/34712269
http://dx.doi.org/10.3389/fgene.2021.745901
_version_ 1784590170107936768
author Wolf , Jack M.
Westra, Jason
Tintle, Nathan
author_facet Wolf , Jack M.
Westra, Jason
Tintle, Nathan
author_sort Wolf , Jack M.
collection PubMed
description While the promise of electronic medical record and biobank data is large, major questions remain about patient privacy, computational hurdles, and data access. One promising area of recent development is pre-computing non-individually identifiable summary statistics to be made publicly available for exploration and downstream analysis. In this manuscript we demonstrate how to utilize pre-computed linear association statistics between individual genetic variants and phenotypes to infer genetic relationships between products of phenotypes (e.g., ratios; logical combinations of binary phenotypes using “and” and “or”) with customized covariate choices. We propose a method to approximate covariate adjusted linear models for products and logical combinations of phenotypes using only pre-computed summary statistics. We evaluate our method’s accuracy through several simulation studies and an application modeling ratios of fatty acids using data from the Framingham Heart Study. These studies show consistent ability to recapitulate analysis results performed on individual level data including maintenance of the Type I error rate, power, and effect size estimates. An implementation of this proposed method is available in the publicly available R package pcsstools.
format Online
Article
Text
id pubmed-8546319
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85463192021-10-27 Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates Wolf , Jack M. Westra, Jason Tintle, Nathan Front Genet Genetics While the promise of electronic medical record and biobank data is large, major questions remain about patient privacy, computational hurdles, and data access. One promising area of recent development is pre-computing non-individually identifiable summary statistics to be made publicly available for exploration and downstream analysis. In this manuscript we demonstrate how to utilize pre-computed linear association statistics between individual genetic variants and phenotypes to infer genetic relationships between products of phenotypes (e.g., ratios; logical combinations of binary phenotypes using “and” and “or”) with customized covariate choices. We propose a method to approximate covariate adjusted linear models for products and logical combinations of phenotypes using only pre-computed summary statistics. We evaluate our method’s accuracy through several simulation studies and an application modeling ratios of fatty acids using data from the Framingham Heart Study. These studies show consistent ability to recapitulate analysis results performed on individual level data including maintenance of the Type I error rate, power, and effect size estimates. An implementation of this proposed method is available in the publicly available R package pcsstools. Frontiers Media S.A. 2021-10-12 /pmc/articles/PMC8546319/ /pubmed/34712269 http://dx.doi.org/10.3389/fgene.2021.745901 Text en Copyright © 2021 Wolf , Westra and Tintle. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wolf , Jack M.
Westra, Jason
Tintle, Nathan
Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates
title Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates
title_full Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates
title_fullStr Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates
title_full_unstemmed Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates
title_short Using Summary Statistics to Model Multiplicative Combinations of Initially Analyzed Phenotypes With a Flexible Choice of Covariates
title_sort using summary statistics to model multiplicative combinations of initially analyzed phenotypes with a flexible choice of covariates
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8546319/
https://www.ncbi.nlm.nih.gov/pubmed/34712269
http://dx.doi.org/10.3389/fgene.2021.745901
work_keys_str_mv AT wolfjackm usingsummarystatisticstomodelmultiplicativecombinationsofinitiallyanalyzedphenotypeswithaflexiblechoiceofcovariates
AT westrajason usingsummarystatisticstomodelmultiplicativecombinationsofinitiallyanalyzedphenotypeswithaflexiblechoiceofcovariates
AT tintlenathan usingsummarystatisticstomodelmultiplicativecombinationsofinitiallyanalyzedphenotypeswithaflexiblechoiceofcovariates