Cargando…

Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data

The expression of eukaryotic genes is regulated by cis-regulatory elements such as promoters and enhancers, which bind sequence-specific DNA-binding proteins. One of the great challenges in the gene regulation field is to characterise these elements. This involves the identification of transcription...

Descripción completa

Detalles Bibliográficos
Autores principales: Piper, Jason, Elze, Markus C., Cauchy, Pierre, Cockerill, Peter N., Bonifer, Constanze, Ott, Sascha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3834841/
https://www.ncbi.nlm.nih.gov/pubmed/24071585
http://dx.doi.org/10.1093/nar/gkt850
_version_ 1782292054535045120
author Piper, Jason
Elze, Markus C.
Cauchy, Pierre
Cockerill, Peter N.
Bonifer, Constanze
Ott, Sascha
author_facet Piper, Jason
Elze, Markus C.
Cauchy, Pierre
Cockerill, Peter N.
Bonifer, Constanze
Ott, Sascha
author_sort Piper, Jason
collection PubMed
description The expression of eukaryotic genes is regulated by cis-regulatory elements such as promoters and enhancers, which bind sequence-specific DNA-binding proteins. One of the great challenges in the gene regulation field is to characterise these elements. This involves the identification of transcription factor (TF) binding sites within regulatory elements that are occupied in a defined regulatory context. Digestion with DNase and the subsequent analysis of regions protected from cleavage (DNase footprinting) has for many years been used to identify specific binding sites occupied by TFs at individual cis-elements with high resolution. This methodology has recently been adapted for high-throughput sequencing (DNase-seq). In this study, we describe an imbalance in the DNA strand-specific alignment information of DNase-seq data surrounding protein–DNA interactions that allows accurate prediction of occupied TF binding sites. Our study introduces a novel algorithm, Wellington, which considers the imbalance in this strand-specific information to efficiently identify DNA footprints. This algorithm significantly enhances specificity by reducing the proportion of false positives and requires significantly fewer predictions than previously reported methods to recapitulate an equal amount of ChIP-seq data. We also provide an open-source software package, pyDNase, which implements the Wellington algorithm to interface with DNase-seq data and expedite analyses.
format Online
Article
Text
id pubmed-3834841
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-38348412013-11-21 Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data Piper, Jason Elze, Markus C. Cauchy, Pierre Cockerill, Peter N. Bonifer, Constanze Ott, Sascha Nucleic Acids Res Methods Online The expression of eukaryotic genes is regulated by cis-regulatory elements such as promoters and enhancers, which bind sequence-specific DNA-binding proteins. One of the great challenges in the gene regulation field is to characterise these elements. This involves the identification of transcription factor (TF) binding sites within regulatory elements that are occupied in a defined regulatory context. Digestion with DNase and the subsequent analysis of regions protected from cleavage (DNase footprinting) has for many years been used to identify specific binding sites occupied by TFs at individual cis-elements with high resolution. This methodology has recently been adapted for high-throughput sequencing (DNase-seq). In this study, we describe an imbalance in the DNA strand-specific alignment information of DNase-seq data surrounding protein–DNA interactions that allows accurate prediction of occupied TF binding sites. Our study introduces a novel algorithm, Wellington, which considers the imbalance in this strand-specific information to efficiently identify DNA footprints. This algorithm significantly enhances specificity by reducing the proportion of false positives and requires significantly fewer predictions than previously reported methods to recapitulate an equal amount of ChIP-seq data. We also provide an open-source software package, pyDNase, which implements the Wellington algorithm to interface with DNase-seq data and expedite analyses. Oxford University Press 2013-11 2013-09-25 /pmc/articles/PMC3834841/ /pubmed/24071585 http://dx.doi.org/10.1093/nar/gkt850 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Piper, Jason
Elze, Markus C.
Cauchy, Pierre
Cockerill, Peter N.
Bonifer, Constanze
Ott, Sascha
Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data
title Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data
title_full Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data
title_fullStr Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data
title_full_unstemmed Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data
title_short Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data
title_sort wellington: a novel method for the accurate identification of digital genomic footprints from dnase-seq data
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3834841/
https://www.ncbi.nlm.nih.gov/pubmed/24071585
http://dx.doi.org/10.1093/nar/gkt850
work_keys_str_mv AT piperjason wellingtonanovelmethodfortheaccurateidentificationofdigitalgenomicfootprintsfromdnaseseqdata
AT elzemarkusc wellingtonanovelmethodfortheaccurateidentificationofdigitalgenomicfootprintsfromdnaseseqdata
AT cauchypierre wellingtonanovelmethodfortheaccurateidentificationofdigitalgenomicfootprintsfromdnaseseqdata
AT cockerillpetern wellingtonanovelmethodfortheaccurateidentificationofdigitalgenomicfootprintsfromdnaseseqdata
AT boniferconstanze wellingtonanovelmethodfortheaccurateidentificationofdigitalgenomicfootprintsfromdnaseseqdata
AT ottsascha wellingtonanovelmethodfortheaccurateidentificationofdigitalgenomicfootprintsfromdnaseseqdata