Cargando…

STARRPeaker: uniform processing and accurate identification of STARR-seq active regions

STARR-seq technology has employed progressively more complex genomic libraries and increased sequencing depths. An issue with the increased complexity and depth is that the coverage in STARR-seq experiments is non-uniform, overdispersed, and often confounded by sequencing biases, such as GC content....

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Donghoon, Shi, Manman, Moran, Jennifer, Wall, Martha, Zhang, Jing, Liu, Jason, Fitzgerald, Dominic, Kyono, Yasuhiro, Ma, Lijia, White, Kevin P., Gerstein, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7722316/
https://www.ncbi.nlm.nih.gov/pubmed/33292397
http://dx.doi.org/10.1186/s13059-020-02194-x
_version_ 1783620127325945856
author Lee, Donghoon
Shi, Manman
Moran, Jennifer
Wall, Martha
Zhang, Jing
Liu, Jason
Fitzgerald, Dominic
Kyono, Yasuhiro
Ma, Lijia
White, Kevin P.
Gerstein, Mark
author_facet Lee, Donghoon
Shi, Manman
Moran, Jennifer
Wall, Martha
Zhang, Jing
Liu, Jason
Fitzgerald, Dominic
Kyono, Yasuhiro
Ma, Lijia
White, Kevin P.
Gerstein, Mark
author_sort Lee, Donghoon
collection PubMed
description STARR-seq technology has employed progressively more complex genomic libraries and increased sequencing depths. An issue with the increased complexity and depth is that the coverage in STARR-seq experiments is non-uniform, overdispersed, and often confounded by sequencing biases, such as GC content. Furthermore, STARR-seq readout is confounded by RNA secondary structure and thermodynamic stability. To address these potential confounders, we developed a negative binomial regression framework for uniformly processing STARR-seq data, called STARRPeaker. Moreover, to aid our effort, we generated whole-genome STARR-seq data from the HepG2 and K562 human cell lines and applied STARRPeaker to comprehensively and unbiasedly call enhancers in them. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-020-02194-x.
format Online
Article
Text
id pubmed-7722316
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77223162020-12-08 STARRPeaker: uniform processing and accurate identification of STARR-seq active regions Lee, Donghoon Shi, Manman Moran, Jennifer Wall, Martha Zhang, Jing Liu, Jason Fitzgerald, Dominic Kyono, Yasuhiro Ma, Lijia White, Kevin P. Gerstein, Mark Genome Biol Method STARR-seq technology has employed progressively more complex genomic libraries and increased sequencing depths. An issue with the increased complexity and depth is that the coverage in STARR-seq experiments is non-uniform, overdispersed, and often confounded by sequencing biases, such as GC content. Furthermore, STARR-seq readout is confounded by RNA secondary structure and thermodynamic stability. To address these potential confounders, we developed a negative binomial regression framework for uniformly processing STARR-seq data, called STARRPeaker. Moreover, to aid our effort, we generated whole-genome STARR-seq data from the HepG2 and K562 human cell lines and applied STARRPeaker to comprehensively and unbiasedly call enhancers in them. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-020-02194-x. BioMed Central 2020-12-08 /pmc/articles/PMC7722316/ /pubmed/33292397 http://dx.doi.org/10.1186/s13059-020-02194-x Text en © The Author(s) 2020, corrected publication 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Method
Lee, Donghoon
Shi, Manman
Moran, Jennifer
Wall, Martha
Zhang, Jing
Liu, Jason
Fitzgerald, Dominic
Kyono, Yasuhiro
Ma, Lijia
White, Kevin P.
Gerstein, Mark
STARRPeaker: uniform processing and accurate identification of STARR-seq active regions
title STARRPeaker: uniform processing and accurate identification of STARR-seq active regions
title_full STARRPeaker: uniform processing and accurate identification of STARR-seq active regions
title_fullStr STARRPeaker: uniform processing and accurate identification of STARR-seq active regions
title_full_unstemmed STARRPeaker: uniform processing and accurate identification of STARR-seq active regions
title_short STARRPeaker: uniform processing and accurate identification of STARR-seq active regions
title_sort starrpeaker: uniform processing and accurate identification of starr-seq active regions
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7722316/
https://www.ncbi.nlm.nih.gov/pubmed/33292397
http://dx.doi.org/10.1186/s13059-020-02194-x
work_keys_str_mv AT leedonghoon starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT shimanman starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT moranjennifer starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT wallmartha starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT zhangjing starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT liujason starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT fitzgeralddominic starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT kyonoyasuhiro starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT malijia starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT whitekevinp starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions
AT gersteinmark starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions