Cargando…
STARRPeaker: uniform processing and accurate identification of STARR-seq active regions
STARR-seq technology has employed progressively more complex genomic libraries and increased sequencing depths. An issue with the increased complexity and depth is that the coverage in STARR-seq experiments is non-uniform, overdispersed, and often confounded by sequencing biases, such as GC content....
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7722316/ https://www.ncbi.nlm.nih.gov/pubmed/33292397 http://dx.doi.org/10.1186/s13059-020-02194-x |
_version_ | 1783620127325945856 |
---|---|
author | Lee, Donghoon Shi, Manman Moran, Jennifer Wall, Martha Zhang, Jing Liu, Jason Fitzgerald, Dominic Kyono, Yasuhiro Ma, Lijia White, Kevin P. Gerstein, Mark |
author_facet | Lee, Donghoon Shi, Manman Moran, Jennifer Wall, Martha Zhang, Jing Liu, Jason Fitzgerald, Dominic Kyono, Yasuhiro Ma, Lijia White, Kevin P. Gerstein, Mark |
author_sort | Lee, Donghoon |
collection | PubMed |
description | STARR-seq technology has employed progressively more complex genomic libraries and increased sequencing depths. An issue with the increased complexity and depth is that the coverage in STARR-seq experiments is non-uniform, overdispersed, and often confounded by sequencing biases, such as GC content. Furthermore, STARR-seq readout is confounded by RNA secondary structure and thermodynamic stability. To address these potential confounders, we developed a negative binomial regression framework for uniformly processing STARR-seq data, called STARRPeaker. Moreover, to aid our effort, we generated whole-genome STARR-seq data from the HepG2 and K562 human cell lines and applied STARRPeaker to comprehensively and unbiasedly call enhancers in them. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-020-02194-x. |
format | Online Article Text |
id | pubmed-7722316 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-77223162020-12-08 STARRPeaker: uniform processing and accurate identification of STARR-seq active regions Lee, Donghoon Shi, Manman Moran, Jennifer Wall, Martha Zhang, Jing Liu, Jason Fitzgerald, Dominic Kyono, Yasuhiro Ma, Lijia White, Kevin P. Gerstein, Mark Genome Biol Method STARR-seq technology has employed progressively more complex genomic libraries and increased sequencing depths. An issue with the increased complexity and depth is that the coverage in STARR-seq experiments is non-uniform, overdispersed, and often confounded by sequencing biases, such as GC content. Furthermore, STARR-seq readout is confounded by RNA secondary structure and thermodynamic stability. To address these potential confounders, we developed a negative binomial regression framework for uniformly processing STARR-seq data, called STARRPeaker. Moreover, to aid our effort, we generated whole-genome STARR-seq data from the HepG2 and K562 human cell lines and applied STARRPeaker to comprehensively and unbiasedly call enhancers in them. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-020-02194-x. BioMed Central 2020-12-08 /pmc/articles/PMC7722316/ /pubmed/33292397 http://dx.doi.org/10.1186/s13059-020-02194-x Text en © The Author(s) 2020, corrected publication 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Method Lee, Donghoon Shi, Manman Moran, Jennifer Wall, Martha Zhang, Jing Liu, Jason Fitzgerald, Dominic Kyono, Yasuhiro Ma, Lijia White, Kevin P. Gerstein, Mark STARRPeaker: uniform processing and accurate identification of STARR-seq active regions |
title | STARRPeaker: uniform processing and accurate identification of STARR-seq active regions |
title_full | STARRPeaker: uniform processing and accurate identification of STARR-seq active regions |
title_fullStr | STARRPeaker: uniform processing and accurate identification of STARR-seq active regions |
title_full_unstemmed | STARRPeaker: uniform processing and accurate identification of STARR-seq active regions |
title_short | STARRPeaker: uniform processing and accurate identification of STARR-seq active regions |
title_sort | starrpeaker: uniform processing and accurate identification of starr-seq active regions |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7722316/ https://www.ncbi.nlm.nih.gov/pubmed/33292397 http://dx.doi.org/10.1186/s13059-020-02194-x |
work_keys_str_mv | AT leedonghoon starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT shimanman starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT moranjennifer starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT wallmartha starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT zhangjing starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT liujason starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT fitzgeralddominic starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT kyonoyasuhiro starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT malijia starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT whitekevinp starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions AT gersteinmark starrpeakeruniformprocessingandaccurateidentificationofstarrseqactiveregions |