Cargando…
Comparative evaluation of DNase-seq footprint identification strategies
DNase I is an enzyme preferentially cleaving DNA in highly accessible regions. Recently, Next-Generation Sequencing has been applied to DNase I assays (DNase-seq) to obtain genome-wide maps of these accessible chromatin regions. With high-depth sequencing, DNase I cleavage sites can be identified wi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133688/ https://www.ncbi.nlm.nih.gov/pubmed/25177346 http://dx.doi.org/10.3389/fgene.2014.00278 |
_version_ | 1782330774215720960 |
---|---|
author | Barozzi, Iros Bora, Pranami Morelli, Marco J. |
author_facet | Barozzi, Iros Bora, Pranami Morelli, Marco J. |
author_sort | Barozzi, Iros |
collection | PubMed |
description | DNase I is an enzyme preferentially cleaving DNA in highly accessible regions. Recently, Next-Generation Sequencing has been applied to DNase I assays (DNase-seq) to obtain genome-wide maps of these accessible chromatin regions. With high-depth sequencing, DNase I cleavage sites can be identified with base-pair resolution, revealing the presence of protected regions (“footprints”), corresponding to bound molecules on the DNA. Integrating footprint positions close to transcription start sites with motif analysis can reveal the presence of regulatory interactions between specific transcription factors (TFs) and genes. However, this inference heavily relies on the accuracy of the footprint call and on the sequencing depth of the DNase-seq experiment. Using ENCODE data, we comprehensively evaluate the performances of two recent footprint callers (Wellington and DNaseR) and one metric (the Footprint Occupancy Score, or FOS), and assess the consequences of different footprint calls on the reconstruction of TF-TF regulatory networks. We rate Wellington as the method of choice among those tested: not only its predictions are the best in terms of accuracy, but also the properties of the inferred networks are robust against sequencing depth. |
format | Online Article Text |
id | pubmed-4133688 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-41336882014-08-29 Comparative evaluation of DNase-seq footprint identification strategies Barozzi, Iros Bora, Pranami Morelli, Marco J. Front Genet Genetics DNase I is an enzyme preferentially cleaving DNA in highly accessible regions. Recently, Next-Generation Sequencing has been applied to DNase I assays (DNase-seq) to obtain genome-wide maps of these accessible chromatin regions. With high-depth sequencing, DNase I cleavage sites can be identified with base-pair resolution, revealing the presence of protected regions (“footprints”), corresponding to bound molecules on the DNA. Integrating footprint positions close to transcription start sites with motif analysis can reveal the presence of regulatory interactions between specific transcription factors (TFs) and genes. However, this inference heavily relies on the accuracy of the footprint call and on the sequencing depth of the DNase-seq experiment. Using ENCODE data, we comprehensively evaluate the performances of two recent footprint callers (Wellington and DNaseR) and one metric (the Footprint Occupancy Score, or FOS), and assess the consequences of different footprint calls on the reconstruction of TF-TF regulatory networks. We rate Wellington as the method of choice among those tested: not only its predictions are the best in terms of accuracy, but also the properties of the inferred networks are robust against sequencing depth. Frontiers Media S.A. 2014-08-15 /pmc/articles/PMC4133688/ /pubmed/25177346 http://dx.doi.org/10.3389/fgene.2014.00278 Text en Copyright © 2014 Barozzi, Bora and Morelli. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Barozzi, Iros Bora, Pranami Morelli, Marco J. Comparative evaluation of DNase-seq footprint identification strategies |
title | Comparative evaluation of DNase-seq footprint identification strategies |
title_full | Comparative evaluation of DNase-seq footprint identification strategies |
title_fullStr | Comparative evaluation of DNase-seq footprint identification strategies |
title_full_unstemmed | Comparative evaluation of DNase-seq footprint identification strategies |
title_short | Comparative evaluation of DNase-seq footprint identification strategies |
title_sort | comparative evaluation of dnase-seq footprint identification strategies |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133688/ https://www.ncbi.nlm.nih.gov/pubmed/25177346 http://dx.doi.org/10.3389/fgene.2014.00278 |
work_keys_str_mv | AT barozziiros comparativeevaluationofdnaseseqfootprintidentificationstrategies AT borapranami comparativeevaluationofdnaseseqfootprintidentificationstrategies AT morellimarcoj comparativeevaluationofdnaseseqfootprintidentificationstrategies |