Cargando…

Linear models enable powerful differential activity analysis in massively parallel reporter assays

BACKGROUND: Massively parallel reporter assays (MPRAs) have emerged as a popular means for understanding noncoding variation in a variety of conditions. While a large number of experiments have been described in the literature, analysis typically uses ad-hoc methods. There has been little attention...

Descripción completa

Detalles Bibliográficos
Autores principales: Myint, Leslie, Avramopoulos, Dimitrios G., Goff, Loyal A., Hansen, Kasper D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417258/
https://www.ncbi.nlm.nih.gov/pubmed/30866806
http://dx.doi.org/10.1186/s12864-019-5556-x
_version_ 1783403533007060992
author Myint, Leslie
Avramopoulos, Dimitrios G.
Goff, Loyal A.
Hansen, Kasper D.
author_facet Myint, Leslie
Avramopoulos, Dimitrios G.
Goff, Loyal A.
Hansen, Kasper D.
author_sort Myint, Leslie
collection PubMed
description BACKGROUND: Massively parallel reporter assays (MPRAs) have emerged as a popular means for understanding noncoding variation in a variety of conditions. While a large number of experiments have been described in the literature, analysis typically uses ad-hoc methods. There has been little attention to comparing performance of methods across datasets. RESULTS: We present the mpralm method which we show is calibrated and powerful, by analyzing its performance on multiple MPRA datasets. We show that it outperforms existing statistical methods for analysis of this data type, in the first comprehensive evaluation of statistical methods on several datasets. We investigate theoretical and real-data properties of barcode summarization methods and show an unappreciated impact of summarization method for some datasets. Finally, we use our model to conduct a power analysis for this assay and show substantial improvements in power by performing up to 6 replicates per condition, whereas sequencing depth has smaller impact; we recommend to always use at least 4 replicates. An R package is available from the Bioconductor project. CONCLUSIONS: Together, these results inform recommendations for differential analysis, general group comparisons, and power analysis and will help improve design and analysis of MPRA experiments.
format Online
Article
Text
id pubmed-6417258
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64172582019-03-25 Linear models enable powerful differential activity analysis in massively parallel reporter assays Myint, Leslie Avramopoulos, Dimitrios G. Goff, Loyal A. Hansen, Kasper D. BMC Genomics Methodology Article BACKGROUND: Massively parallel reporter assays (MPRAs) have emerged as a popular means for understanding noncoding variation in a variety of conditions. While a large number of experiments have been described in the literature, analysis typically uses ad-hoc methods. There has been little attention to comparing performance of methods across datasets. RESULTS: We present the mpralm method which we show is calibrated and powerful, by analyzing its performance on multiple MPRA datasets. We show that it outperforms existing statistical methods for analysis of this data type, in the first comprehensive evaluation of statistical methods on several datasets. We investigate theoretical and real-data properties of barcode summarization methods and show an unappreciated impact of summarization method for some datasets. Finally, we use our model to conduct a power analysis for this assay and show substantial improvements in power by performing up to 6 replicates per condition, whereas sequencing depth has smaller impact; we recommend to always use at least 4 replicates. An R package is available from the Bioconductor project. CONCLUSIONS: Together, these results inform recommendations for differential analysis, general group comparisons, and power analysis and will help improve design and analysis of MPRA experiments. BioMed Central 2019-03-12 /pmc/articles/PMC6417258/ /pubmed/30866806 http://dx.doi.org/10.1186/s12864-019-5556-x Text en © The Author(s) 2019 Open Access This article is distributed under the terms of theCreative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Myint, Leslie
Avramopoulos, Dimitrios G.
Goff, Loyal A.
Hansen, Kasper D.
Linear models enable powerful differential activity analysis in massively parallel reporter assays
title Linear models enable powerful differential activity analysis in massively parallel reporter assays
title_full Linear models enable powerful differential activity analysis in massively parallel reporter assays
title_fullStr Linear models enable powerful differential activity analysis in massively parallel reporter assays
title_full_unstemmed Linear models enable powerful differential activity analysis in massively parallel reporter assays
title_short Linear models enable powerful differential activity analysis in massively parallel reporter assays
title_sort linear models enable powerful differential activity analysis in massively parallel reporter assays
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417258/
https://www.ncbi.nlm.nih.gov/pubmed/30866806
http://dx.doi.org/10.1186/s12864-019-5556-x
work_keys_str_mv AT myintleslie linearmodelsenablepowerfuldifferentialactivityanalysisinmassivelyparallelreporterassays
AT avramopoulosdimitriosg linearmodelsenablepowerfuldifferentialactivityanalysisinmassivelyparallelreporterassays
AT goffloyala linearmodelsenablepowerfuldifferentialactivityanalysisinmassivelyparallelreporterassays
AT hansenkasperd linearmodelsenablepowerfuldifferentialactivityanalysisinmassivelyparallelreporterassays