Cargando…
FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets
Detecting binding motifs of combinatorial transcription factors (TFs) from chromatin immunoprecipitation sequencing (ChIP-seq) experiments is an important and challenging computational problem for understanding gene regulations. Although a number of motif-finding algorithms have been presented, most...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6589551/ https://www.ncbi.nlm.nih.gov/pubmed/30957858 http://dx.doi.org/10.1093/dnares/dsz004 |
_version_ | 1783429407941066752 |
---|---|
author | Zhang, Shaoqiang Liang, Ying Wang, Xiangyun Su, Zhengchang Chen, Yong |
author_facet | Zhang, Shaoqiang Liang, Ying Wang, Xiangyun Su, Zhengchang Chen, Yong |
author_sort | Zhang, Shaoqiang |
collection | PubMed |
description | Detecting binding motifs of combinatorial transcription factors (TFs) from chromatin immunoprecipitation sequencing (ChIP-seq) experiments is an important and challenging computational problem for understanding gene regulations. Although a number of motif-finding algorithms have been presented, most are either time consuming or have sub-optimal accuracy for processing large-scale datasets. In this article, we present a fully parallelized algorithm for detecting combinatorial motifs from ChIP-seq datasets by using Fisher combined method and OpenMP parallel design. Large scale validations on both synthetic data and 350 ChIP-seq datasets from the ENCODE database showed that FisherMP has not only super speeds on large datasets, but also has high accuracy when compared with multiple popular methods. By using FisherMP, we successfully detected combinatorial motifs of CTCF, YY1, MAZ, STAT3 and USF2 in chromosome X, suggesting that they are functional co-players in gene regulation and chromosomal organization. Integrative and statistical analysis of these TF-binding peaks clearly demonstrate that they are not only highly coordinated with each other, but that they are also correlated with histone modifications. FisherMP can be applied for integrative analysis of binding motifs and for predicting cis-regulatory modules from a large number of ChIP-seq datasets. |
format | Online Article Text |
id | pubmed-6589551 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-65895512019-06-27 FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets Zhang, Shaoqiang Liang, Ying Wang, Xiangyun Su, Zhengchang Chen, Yong DNA Res Full Papers Detecting binding motifs of combinatorial transcription factors (TFs) from chromatin immunoprecipitation sequencing (ChIP-seq) experiments is an important and challenging computational problem for understanding gene regulations. Although a number of motif-finding algorithms have been presented, most are either time consuming or have sub-optimal accuracy for processing large-scale datasets. In this article, we present a fully parallelized algorithm for detecting combinatorial motifs from ChIP-seq datasets by using Fisher combined method and OpenMP parallel design. Large scale validations on both synthetic data and 350 ChIP-seq datasets from the ENCODE database showed that FisherMP has not only super speeds on large datasets, but also has high accuracy when compared with multiple popular methods. By using FisherMP, we successfully detected combinatorial motifs of CTCF, YY1, MAZ, STAT3 and USF2 in chromosome X, suggesting that they are functional co-players in gene regulation and chromosomal organization. Integrative and statistical analysis of these TF-binding peaks clearly demonstrate that they are not only highly coordinated with each other, but that they are also correlated with histone modifications. FisherMP can be applied for integrative analysis of binding motifs and for predicting cis-regulatory modules from a large number of ChIP-seq datasets. Oxford University Press 2019-06 2019-04-08 /pmc/articles/PMC6589551/ /pubmed/30957858 http://dx.doi.org/10.1093/dnares/dsz004 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Kazusa DNA Research Institute. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Full Papers Zhang, Shaoqiang Liang, Ying Wang, Xiangyun Su, Zhengchang Chen, Yong FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets |
title | FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets |
title_full | FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets |
title_fullStr | FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets |
title_full_unstemmed | FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets |
title_short | FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets |
title_sort | fishermp: fully parallel algorithm for detecting combinatorial motifs from large chip-seq datasets |
topic | Full Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6589551/ https://www.ncbi.nlm.nih.gov/pubmed/30957858 http://dx.doi.org/10.1093/dnares/dsz004 |
work_keys_str_mv | AT zhangshaoqiang fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets AT liangying fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets AT wangxiangyun fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets AT suzhengchang fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets AT chenyong fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets |