Cargando…
Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types
The human genome contains millions of candidate cis-regulatory elements (CREs) with cell-type-specific activities that shape both health and myriad disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028905/ https://www.ncbi.nlm.nih.gov/pubmed/36945371 http://dx.doi.org/10.1101/2023.03.05.531189 |
_version_ | 1784910041245024256 |
---|---|
author | Agarwal, Vikram Inoue, Fumitaka Schubach, Max Martin, Beth K. Dash, Pyaree Mohan Zhang, Zicong Sohota, Ajuni Noble, William Stafford Yardimci, Galip Gürkan Kircher, Martin Shendure, Jay Ahituv, Nadav |
author_facet | Agarwal, Vikram Inoue, Fumitaka Schubach, Max Martin, Beth K. Dash, Pyaree Mohan Zhang, Zicong Sohota, Ajuni Noble, William Stafford Yardimci, Galip Gürkan Kircher, Martin Shendure, Jay Ahituv, Nadav |
author_sort | Agarwal, Vikram |
collection | PubMed |
description | The human genome contains millions of candidate cis-regulatory elements (CREs) with cell-type-specific activities that shape both health and myriad disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these CREs. Here, we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of over 680,000 sequences, representing a nearly comprehensive set of all annotated CREs among three cell types (HepG2, K562, and WTC11), finding 41.7% to be functional. By testing sequences in both orientations, we find promoters to have significant strand orientation effects. We also observe that their 200 nucleotide cores function as non-cell-type-specific ‘on switches’ providing similar expression levels to their associated gene. In contrast, enhancers have weaker orientation effects, but increased tissue-specific characteristics. Utilizing our lentiMPRA data, we develop sequence-based models to predict CRE function with high accuracy and delineate regulatory motifs. Testing an additional lentiMPRA library encompassing 60,000 CREs in all three cell types, we further identified factors that determine cell-type specificity. Collectively, our work provides an exhaustive catalog of functional CREs in three widely used cell lines, and showcases how large-scale functional measurements can be used to dissect regulatory grammar. |
format | Online Article Text |
id | pubmed-10028905 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-100289052023-03-22 Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types Agarwal, Vikram Inoue, Fumitaka Schubach, Max Martin, Beth K. Dash, Pyaree Mohan Zhang, Zicong Sohota, Ajuni Noble, William Stafford Yardimci, Galip Gürkan Kircher, Martin Shendure, Jay Ahituv, Nadav bioRxiv Article The human genome contains millions of candidate cis-regulatory elements (CREs) with cell-type-specific activities that shape both health and myriad disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these CREs. Here, we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of over 680,000 sequences, representing a nearly comprehensive set of all annotated CREs among three cell types (HepG2, K562, and WTC11), finding 41.7% to be functional. By testing sequences in both orientations, we find promoters to have significant strand orientation effects. We also observe that their 200 nucleotide cores function as non-cell-type-specific ‘on switches’ providing similar expression levels to their associated gene. In contrast, enhancers have weaker orientation effects, but increased tissue-specific characteristics. Utilizing our lentiMPRA data, we develop sequence-based models to predict CRE function with high accuracy and delineate regulatory motifs. Testing an additional lentiMPRA library encompassing 60,000 CREs in all three cell types, we further identified factors that determine cell-type specificity. Collectively, our work provides an exhaustive catalog of functional CREs in three widely used cell lines, and showcases how large-scale functional measurements can be used to dissect regulatory grammar. Cold Spring Harbor Laboratory 2023-03-06 /pmc/articles/PMC10028905/ /pubmed/36945371 http://dx.doi.org/10.1101/2023.03.05.531189 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Agarwal, Vikram Inoue, Fumitaka Schubach, Max Martin, Beth K. Dash, Pyaree Mohan Zhang, Zicong Sohota, Ajuni Noble, William Stafford Yardimci, Galip Gürkan Kircher, Martin Shendure, Jay Ahituv, Nadav Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types |
title | Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types |
title_full | Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types |
title_fullStr | Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types |
title_full_unstemmed | Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types |
title_short | Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types |
title_sort | massively parallel characterization of transcriptional regulatory elements in three diverse human cell types |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028905/ https://www.ncbi.nlm.nih.gov/pubmed/36945371 http://dx.doi.org/10.1101/2023.03.05.531189 |
work_keys_str_mv | AT agarwalvikram massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT inouefumitaka massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT schubachmax massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT martinbethk massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT dashpyareemohan massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT zhangzicong massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT sohotaajuni massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT noblewilliamstafford massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT yardimcigalipgurkan massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT kirchermartin massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT shendurejay massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes AT ahituvnadav massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes |