Cargando…

Meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types

Deciphering the potential of noncoding loci to influence gene regulation has been the subject of intense research, with important implications in understanding genetic underpinnings of human diseases. Massively parallel reporter assays (MPRAs) can measure regulatory activity of thousands of DNA sequ...

Descripción completa

Detalles Bibliográficos
Autores principales: Kreimer, Anat, Yan, Zhongxia, Ahituv, Nadav, Yosef, Nir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6771677/
https://www.ncbi.nlm.nih.gov/pubmed/31131957
http://dx.doi.org/10.1002/humu.23820
_version_ 1783455742593859584
author Kreimer, Anat
Yan, Zhongxia
Ahituv, Nadav
Yosef, Nir
author_facet Kreimer, Anat
Yan, Zhongxia
Ahituv, Nadav
Yosef, Nir
author_sort Kreimer, Anat
collection PubMed
description Deciphering the potential of noncoding loci to influence gene regulation has been the subject of intense research, with important implications in understanding genetic underpinnings of human diseases. Massively parallel reporter assays (MPRAs) can measure regulatory activity of thousands of DNA sequences and their variants in a single experiment. With increasing number of publically available MPRA data sets, one can now develop data‐driven models which, given a DNA sequence, predict its regulatory activity. Here, we performed a comprehensive meta‐analysis of several MPRA data sets in a variety of cellular contexts. We first applied an ensemble of methods to predict MPRA output in each context and observed that the most predictive features are consistent across data sets. We then demonstrate that predictive models trained in one cellular context can be used to predict MPRA output in another, with loss of accuracy attributed to cell‐type‐specific features. Finally, we show that our approach achieves top performance in the Fifth Critical Assessment of Genome Interpretation “Regulation Saturation” Challenge for predicting effects of single‐nucleotide variants. Overall, our analysis provides insights into how MPRA data can be leveraged to highlight functional regulatory regions throughout the genome and can guide effective design of future experiments by better prioritizing regions of interest.
format Online
Article
Text
id pubmed-6771677
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-67716772019-10-07 Meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types Kreimer, Anat Yan, Zhongxia Ahituv, Nadav Yosef, Nir Hum Mutat Special Articles Deciphering the potential of noncoding loci to influence gene regulation has been the subject of intense research, with important implications in understanding genetic underpinnings of human diseases. Massively parallel reporter assays (MPRAs) can measure regulatory activity of thousands of DNA sequences and their variants in a single experiment. With increasing number of publically available MPRA data sets, one can now develop data‐driven models which, given a DNA sequence, predict its regulatory activity. Here, we performed a comprehensive meta‐analysis of several MPRA data sets in a variety of cellular contexts. We first applied an ensemble of methods to predict MPRA output in each context and observed that the most predictive features are consistent across data sets. We then demonstrate that predictive models trained in one cellular context can be used to predict MPRA output in another, with loss of accuracy attributed to cell‐type‐specific features. Finally, we show that our approach achieves top performance in the Fifth Critical Assessment of Genome Interpretation “Regulation Saturation” Challenge for predicting effects of single‐nucleotide variants. Overall, our analysis provides insights into how MPRA data can be leveraged to highlight functional regulatory regions throughout the genome and can guide effective design of future experiments by better prioritizing regions of interest. John Wiley and Sons Inc. 2019-06-18 2019-09 /pmc/articles/PMC6771677/ /pubmed/31131957 http://dx.doi.org/10.1002/humu.23820 Text en © 2019 The Authors Human Mutation Published by Wiley Periodicals, Inc. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Special Articles
Kreimer, Anat
Yan, Zhongxia
Ahituv, Nadav
Yosef, Nir
Meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types
title Meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types
title_full Meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types
title_fullStr Meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types
title_full_unstemmed Meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types
title_short Meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types
title_sort meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types
topic Special Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6771677/
https://www.ncbi.nlm.nih.gov/pubmed/31131957
http://dx.doi.org/10.1002/humu.23820
work_keys_str_mv AT kreimeranat metaanalysisofmassivelyparallelreporterassaysenablespredictionofregulatoryfunctionacrosscelltypes
AT yanzhongxia metaanalysisofmassivelyparallelreporterassaysenablespredictionofregulatoryfunctionacrosscelltypes
AT ahituvnadav metaanalysisofmassivelyparallelreporterassaysenablespredictionofregulatoryfunctionacrosscelltypes
AT yosefnir metaanalysisofmassivelyparallelreporterassaysenablespredictionofregulatoryfunctionacrosscelltypes