Cargando…

Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types

The human genome contains millions of candidate cis-regulatory elements (CREs) with cell-type-specific activities that shape both health and myriad disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these...

Descripción completa

Detalles Bibliográficos
Autores principales: Agarwal, Vikram, Inoue, Fumitaka, Schubach, Max, Martin, Beth K., Dash, Pyaree Mohan, Zhang, Zicong, Sohota, Ajuni, Noble, William Stafford, Yardimci, Galip Gürkan, Kircher, Martin, Shendure, Jay, Ahituv, Nadav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028905/
https://www.ncbi.nlm.nih.gov/pubmed/36945371
http://dx.doi.org/10.1101/2023.03.05.531189
_version_ 1784910041245024256
author Agarwal, Vikram
Inoue, Fumitaka
Schubach, Max
Martin, Beth K.
Dash, Pyaree Mohan
Zhang, Zicong
Sohota, Ajuni
Noble, William Stafford
Yardimci, Galip Gürkan
Kircher, Martin
Shendure, Jay
Ahituv, Nadav
author_facet Agarwal, Vikram
Inoue, Fumitaka
Schubach, Max
Martin, Beth K.
Dash, Pyaree Mohan
Zhang, Zicong
Sohota, Ajuni
Noble, William Stafford
Yardimci, Galip Gürkan
Kircher, Martin
Shendure, Jay
Ahituv, Nadav
author_sort Agarwal, Vikram
collection PubMed
description The human genome contains millions of candidate cis-regulatory elements (CREs) with cell-type-specific activities that shape both health and myriad disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these CREs. Here, we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of over 680,000 sequences, representing a nearly comprehensive set of all annotated CREs among three cell types (HepG2, K562, and WTC11), finding 41.7% to be functional. By testing sequences in both orientations, we find promoters to have significant strand orientation effects. We also observe that their 200 nucleotide cores function as non-cell-type-specific ‘on switches’ providing similar expression levels to their associated gene. In contrast, enhancers have weaker orientation effects, but increased tissue-specific characteristics. Utilizing our lentiMPRA data, we develop sequence-based models to predict CRE function with high accuracy and delineate regulatory motifs. Testing an additional lentiMPRA library encompassing 60,000 CREs in all three cell types, we further identified factors that determine cell-type specificity. Collectively, our work provides an exhaustive catalog of functional CREs in three widely used cell lines, and showcases how large-scale functional measurements can be used to dissect regulatory grammar.
format Online
Article
Text
id pubmed-10028905
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100289052023-03-22 Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types Agarwal, Vikram Inoue, Fumitaka Schubach, Max Martin, Beth K. Dash, Pyaree Mohan Zhang, Zicong Sohota, Ajuni Noble, William Stafford Yardimci, Galip Gürkan Kircher, Martin Shendure, Jay Ahituv, Nadav bioRxiv Article The human genome contains millions of candidate cis-regulatory elements (CREs) with cell-type-specific activities that shape both health and myriad disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these CREs. Here, we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of over 680,000 sequences, representing a nearly comprehensive set of all annotated CREs among three cell types (HepG2, K562, and WTC11), finding 41.7% to be functional. By testing sequences in both orientations, we find promoters to have significant strand orientation effects. We also observe that their 200 nucleotide cores function as non-cell-type-specific ‘on switches’ providing similar expression levels to their associated gene. In contrast, enhancers have weaker orientation effects, but increased tissue-specific characteristics. Utilizing our lentiMPRA data, we develop sequence-based models to predict CRE function with high accuracy and delineate regulatory motifs. Testing an additional lentiMPRA library encompassing 60,000 CREs in all three cell types, we further identified factors that determine cell-type specificity. Collectively, our work provides an exhaustive catalog of functional CREs in three widely used cell lines, and showcases how large-scale functional measurements can be used to dissect regulatory grammar. Cold Spring Harbor Laboratory 2023-03-06 /pmc/articles/PMC10028905/ /pubmed/36945371 http://dx.doi.org/10.1101/2023.03.05.531189 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Agarwal, Vikram
Inoue, Fumitaka
Schubach, Max
Martin, Beth K.
Dash, Pyaree Mohan
Zhang, Zicong
Sohota, Ajuni
Noble, William Stafford
Yardimci, Galip Gürkan
Kircher, Martin
Shendure, Jay
Ahituv, Nadav
Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types
title Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types
title_full Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types
title_fullStr Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types
title_full_unstemmed Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types
title_short Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types
title_sort massively parallel characterization of transcriptional regulatory elements in three diverse human cell types
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028905/
https://www.ncbi.nlm.nih.gov/pubmed/36945371
http://dx.doi.org/10.1101/2023.03.05.531189
work_keys_str_mv AT agarwalvikram massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT inouefumitaka massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT schubachmax massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT martinbethk massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT dashpyareemohan massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT zhangzicong massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT sohotaajuni massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT noblewilliamstafford massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT yardimcigalipgurkan massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT kirchermartin massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT shendurejay massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes
AT ahituvnadav massivelyparallelcharacterizationoftranscriptionalregulatoryelementsinthreediversehumancelltypes