Cargando…

TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence

Transcription is tightly regulated by cis-regulatory DNA elements where transcription factors (TFs) can bind. Thus, identification of TF binding sites (TFBSs) is key to understanding gene expression and whole regulatory networks within a cell. The standard approaches used for TFBS prediction, such a...

Descripción completa

Detalles Bibliográficos
Autores principales: Ouyang, Ningxin, Boyle, Alan P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7397869/
https://www.ncbi.nlm.nih.gov/pubmed/32660981
http://dx.doi.org/10.1101/gr.258228.119
_version_ 1783565846093758464
author Ouyang, Ningxin
Boyle, Alan P.
author_facet Ouyang, Ningxin
Boyle, Alan P.
author_sort Ouyang, Ningxin
collection PubMed
description Transcription is tightly regulated by cis-regulatory DNA elements where transcription factors (TFs) can bind. Thus, identification of TF binding sites (TFBSs) is key to understanding gene expression and whole regulatory networks within a cell. The standard approaches used for TFBS prediction, such as position weight matrices (PWMs) and chromatin immunoprecipitation followed by sequencing (ChIP-seq), are widely used but have their drawbacks, including high false-positive rates and limited antibody availability, respectively. Several computational footprinting algorithms have been developed to detect TFBSs by investigating chromatin accessibility patterns; however, these also have limitations. We have developed a footprinting method to predict TF footprints in active chromatin elements (TRACE) to improve the prediction of TFBS footprints. TRACE incorporates DNase-seq data and PWMs within a multivariate hidden Markov model (HMM) to detect footprint-like regions with matching motifs. TRACE is an unsupervised method that accurately annotates binding sites for specific TFs automatically with no requirement for pregenerated candidate binding sites or ChIP-seq training data. Compared with published footprinting algorithms, TRACE has the best overall performance with the distinct advantage of targeting multiple motifs in a single model.
format Online
Article
Text
id pubmed-7397869
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-73978692021-01-01 TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence Ouyang, Ningxin Boyle, Alan P. Genome Res Method Transcription is tightly regulated by cis-regulatory DNA elements where transcription factors (TFs) can bind. Thus, identification of TF binding sites (TFBSs) is key to understanding gene expression and whole regulatory networks within a cell. The standard approaches used for TFBS prediction, such as position weight matrices (PWMs) and chromatin immunoprecipitation followed by sequencing (ChIP-seq), are widely used but have their drawbacks, including high false-positive rates and limited antibody availability, respectively. Several computational footprinting algorithms have been developed to detect TFBSs by investigating chromatin accessibility patterns; however, these also have limitations. We have developed a footprinting method to predict TF footprints in active chromatin elements (TRACE) to improve the prediction of TFBS footprints. TRACE incorporates DNase-seq data and PWMs within a multivariate hidden Markov model (HMM) to detect footprint-like regions with matching motifs. TRACE is an unsupervised method that accurately annotates binding sites for specific TFs automatically with no requirement for pregenerated candidate binding sites or ChIP-seq training data. Compared with published footprinting algorithms, TRACE has the best overall performance with the distinct advantage of targeting multiple motifs in a single model. Cold Spring Harbor Laboratory Press 2020-07 /pmc/articles/PMC7397869/ /pubmed/32660981 http://dx.doi.org/10.1101/gr.258228.119 Text en © 2020 Ouyang and Boyle; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Ouyang, Ningxin
Boyle, Alan P.
TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence
title TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence
title_full TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence
title_fullStr TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence
title_full_unstemmed TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence
title_short TRACE: transcription factor footprinting using chromatin accessibility data and DNA sequence
title_sort trace: transcription factor footprinting using chromatin accessibility data and dna sequence
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7397869/
https://www.ncbi.nlm.nih.gov/pubmed/32660981
http://dx.doi.org/10.1101/gr.258228.119
work_keys_str_mv AT ouyangningxin tracetranscriptionfactorfootprintingusingchromatinaccessibilitydataanddnasequence
AT boylealanp tracetranscriptionfactorfootprintingusingchromatinaccessibilitydataanddnasequence