Cargando…

Fast effect size shrinkage software for beta-binomial models of allelic imbalance

Allelic imbalance occurs when the two alleles of a gene are differentially expressed within a diploid organism and can indicate important differences in cis-regulation and epigenetic state across the two chromosomes. Because of this, the ability to accurately quantify the proportion at which each al...

Descripción completa

Detalles Bibliográficos
Autores principales: Zitovsky, Joshua P., Love, Michael I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7974632/
https://www.ncbi.nlm.nih.gov/pubmed/33796271
http://dx.doi.org/10.12688/f1000research.20916.2
_version_ 1783666934133293056
author Zitovsky, Joshua P.
Love, Michael I.
author_facet Zitovsky, Joshua P.
Love, Michael I.
author_sort Zitovsky, Joshua P.
collection PubMed
description Allelic imbalance occurs when the two alleles of a gene are differentially expressed within a diploid organism and can indicate important differences in cis-regulation and epigenetic state across the two chromosomes. Because of this, the ability to accurately quantify the proportion at which each allele of a gene is expressed is of great interest to researchers. This becomes challenging in the presence of small read counts and/or sample sizes, which can cause estimators for allelic expression proportions to have high variance. Investigators have traditionally dealt with this problem by filtering out genes with small counts and samples. However, this may inadvertently remove important genes that have truly large allelic imbalances. Another option is to use pseudocounts or Bayesian estimators to reduce the variance. To this end, we evaluated the accuracy of four different estimators, the latter two of which are Bayesian shrinkage estimators: maximum likelihood, adding a pseudocount to each allele, approximate posterior estimation of GLM coefficients (apeglm) and adaptive shrinkage (ash). We also wrote C++ code to quickly calculate ML and apeglm estimates and integrated it into the apeglm package. The four methods were evaluated on two simulations and one real data set. Apeglm consistently performed better than ML according to a variety of criteria, and generally outperformed use of pseudocounts as well. Ash also performed better than ML in one of the simulations, but in the other performance was more mixed. Finally, when compared to five other packages that also fit beta-binomial models, the apeglm package was substantially faster and more numerically reliable, making our package useful for quick and reliable analyses of allelic imbalance. Apeglm is available as an R/Bioconductor package at http://bioconductor.org/packages/apeglm.
format Online
Article
Text
id pubmed-7974632
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-79746322021-03-31 Fast effect size shrinkage software for beta-binomial models of allelic imbalance Zitovsky, Joshua P. Love, Michael I. F1000Res Method Article Allelic imbalance occurs when the two alleles of a gene are differentially expressed within a diploid organism and can indicate important differences in cis-regulation and epigenetic state across the two chromosomes. Because of this, the ability to accurately quantify the proportion at which each allele of a gene is expressed is of great interest to researchers. This becomes challenging in the presence of small read counts and/or sample sizes, which can cause estimators for allelic expression proportions to have high variance. Investigators have traditionally dealt with this problem by filtering out genes with small counts and samples. However, this may inadvertently remove important genes that have truly large allelic imbalances. Another option is to use pseudocounts or Bayesian estimators to reduce the variance. To this end, we evaluated the accuracy of four different estimators, the latter two of which are Bayesian shrinkage estimators: maximum likelihood, adding a pseudocount to each allele, approximate posterior estimation of GLM coefficients (apeglm) and adaptive shrinkage (ash). We also wrote C++ code to quickly calculate ML and apeglm estimates and integrated it into the apeglm package. The four methods were evaluated on two simulations and one real data set. Apeglm consistently performed better than ML according to a variety of criteria, and generally outperformed use of pseudocounts as well. Ash also performed better than ML in one of the simulations, but in the other performance was more mixed. Finally, when compared to five other packages that also fit beta-binomial models, the apeglm package was substantially faster and more numerically reliable, making our package useful for quick and reliable analyses of allelic imbalance. Apeglm is available as an R/Bioconductor package at http://bioconductor.org/packages/apeglm. F1000 Research Limited 2020-12-14 /pmc/articles/PMC7974632/ /pubmed/33796271 http://dx.doi.org/10.12688/f1000research.20916.2 Text en Copyright: © 2020 Zitovsky JP and Love MI http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Zitovsky, Joshua P.
Love, Michael I.
Fast effect size shrinkage software for beta-binomial models of allelic imbalance
title Fast effect size shrinkage software for beta-binomial models of allelic imbalance
title_full Fast effect size shrinkage software for beta-binomial models of allelic imbalance
title_fullStr Fast effect size shrinkage software for beta-binomial models of allelic imbalance
title_full_unstemmed Fast effect size shrinkage software for beta-binomial models of allelic imbalance
title_short Fast effect size shrinkage software for beta-binomial models of allelic imbalance
title_sort fast effect size shrinkage software for beta-binomial models of allelic imbalance
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7974632/
https://www.ncbi.nlm.nih.gov/pubmed/33796271
http://dx.doi.org/10.12688/f1000research.20916.2
work_keys_str_mv AT zitovskyjoshuap fasteffectsizeshrinkagesoftwareforbetabinomialmodelsofallelicimbalance
AT lovemichaeli fasteffectsizeshrinkagesoftwareforbetabinomialmodelsofallelicimbalance