Cargando…

Comparative evaluation of DNase-seq footprint identification strategies

DNase I is an enzyme preferentially cleaving DNA in highly accessible regions. Recently, Next-Generation Sequencing has been applied to DNase I assays (DNase-seq) to obtain genome-wide maps of these accessible chromatin regions. With high-depth sequencing, DNase I cleavage sites can be identified wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Barozzi, Iros, Bora, Pranami, Morelli, Marco J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133688/
https://www.ncbi.nlm.nih.gov/pubmed/25177346
http://dx.doi.org/10.3389/fgene.2014.00278
_version_ 1782330774215720960
author Barozzi, Iros
Bora, Pranami
Morelli, Marco J.
author_facet Barozzi, Iros
Bora, Pranami
Morelli, Marco J.
author_sort Barozzi, Iros
collection PubMed
description DNase I is an enzyme preferentially cleaving DNA in highly accessible regions. Recently, Next-Generation Sequencing has been applied to DNase I assays (DNase-seq) to obtain genome-wide maps of these accessible chromatin regions. With high-depth sequencing, DNase I cleavage sites can be identified with base-pair resolution, revealing the presence of protected regions (“footprints”), corresponding to bound molecules on the DNA. Integrating footprint positions close to transcription start sites with motif analysis can reveal the presence of regulatory interactions between specific transcription factors (TFs) and genes. However, this inference heavily relies on the accuracy of the footprint call and on the sequencing depth of the DNase-seq experiment. Using ENCODE data, we comprehensively evaluate the performances of two recent footprint callers (Wellington and DNaseR) and one metric (the Footprint Occupancy Score, or FOS), and assess the consequences of different footprint calls on the reconstruction of TF-TF regulatory networks. We rate Wellington as the method of choice among those tested: not only its predictions are the best in terms of accuracy, but also the properties of the inferred networks are robust against sequencing depth.
format Online
Article
Text
id pubmed-4133688
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-41336882014-08-29 Comparative evaluation of DNase-seq footprint identification strategies Barozzi, Iros Bora, Pranami Morelli, Marco J. Front Genet Genetics DNase I is an enzyme preferentially cleaving DNA in highly accessible regions. Recently, Next-Generation Sequencing has been applied to DNase I assays (DNase-seq) to obtain genome-wide maps of these accessible chromatin regions. With high-depth sequencing, DNase I cleavage sites can be identified with base-pair resolution, revealing the presence of protected regions (“footprints”), corresponding to bound molecules on the DNA. Integrating footprint positions close to transcription start sites with motif analysis can reveal the presence of regulatory interactions between specific transcription factors (TFs) and genes. However, this inference heavily relies on the accuracy of the footprint call and on the sequencing depth of the DNase-seq experiment. Using ENCODE data, we comprehensively evaluate the performances of two recent footprint callers (Wellington and DNaseR) and one metric (the Footprint Occupancy Score, or FOS), and assess the consequences of different footprint calls on the reconstruction of TF-TF regulatory networks. We rate Wellington as the method of choice among those tested: not only its predictions are the best in terms of accuracy, but also the properties of the inferred networks are robust against sequencing depth. Frontiers Media S.A. 2014-08-15 /pmc/articles/PMC4133688/ /pubmed/25177346 http://dx.doi.org/10.3389/fgene.2014.00278 Text en Copyright © 2014 Barozzi, Bora and Morelli. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Barozzi, Iros
Bora, Pranami
Morelli, Marco J.
Comparative evaluation of DNase-seq footprint identification strategies
title Comparative evaluation of DNase-seq footprint identification strategies
title_full Comparative evaluation of DNase-seq footprint identification strategies
title_fullStr Comparative evaluation of DNase-seq footprint identification strategies
title_full_unstemmed Comparative evaluation of DNase-seq footprint identification strategies
title_short Comparative evaluation of DNase-seq footprint identification strategies
title_sort comparative evaluation of dnase-seq footprint identification strategies
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133688/
https://www.ncbi.nlm.nih.gov/pubmed/25177346
http://dx.doi.org/10.3389/fgene.2014.00278
work_keys_str_mv AT barozziiros comparativeevaluationofdnaseseqfootprintidentificationstrategies
AT borapranami comparativeevaluationofdnaseseqfootprintidentificationstrategies
AT morellimarcoj comparativeevaluationofdnaseseqfootprintidentificationstrategies