Cargando…

FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets

Detecting binding motifs of combinatorial transcription factors (TFs) from chromatin immunoprecipitation sequencing (ChIP-seq) experiments is an important and challenging computational problem for understanding gene regulations. Although a number of motif-finding algorithms have been presented, most...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Shaoqiang, Liang, Ying, Wang, Xiangyun, Su, Zhengchang, Chen, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6589551/
https://www.ncbi.nlm.nih.gov/pubmed/30957858
http://dx.doi.org/10.1093/dnares/dsz004
_version_ 1783429407941066752
author Zhang, Shaoqiang
Liang, Ying
Wang, Xiangyun
Su, Zhengchang
Chen, Yong
author_facet Zhang, Shaoqiang
Liang, Ying
Wang, Xiangyun
Su, Zhengchang
Chen, Yong
author_sort Zhang, Shaoqiang
collection PubMed
description Detecting binding motifs of combinatorial transcription factors (TFs) from chromatin immunoprecipitation sequencing (ChIP-seq) experiments is an important and challenging computational problem for understanding gene regulations. Although a number of motif-finding algorithms have been presented, most are either time consuming or have sub-optimal accuracy for processing large-scale datasets. In this article, we present a fully parallelized algorithm for detecting combinatorial motifs from ChIP-seq datasets by using Fisher combined method and OpenMP parallel design. Large scale validations on both synthetic data and 350 ChIP-seq datasets from the ENCODE database showed that FisherMP has not only super speeds on large datasets, but also has high accuracy when compared with multiple popular methods. By using FisherMP, we successfully detected combinatorial motifs of CTCF, YY1, MAZ, STAT3 and USF2 in chromosome X, suggesting that they are functional co-players in gene regulation and chromosomal organization. Integrative and statistical analysis of these TF-binding peaks clearly demonstrate that they are not only highly coordinated with each other, but that they are also correlated with histone modifications. FisherMP can be applied for integrative analysis of binding motifs and for predicting cis-regulatory modules from a large number of ChIP-seq datasets.
format Online
Article
Text
id pubmed-6589551
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-65895512019-06-27 FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets Zhang, Shaoqiang Liang, Ying Wang, Xiangyun Su, Zhengchang Chen, Yong DNA Res Full Papers Detecting binding motifs of combinatorial transcription factors (TFs) from chromatin immunoprecipitation sequencing (ChIP-seq) experiments is an important and challenging computational problem for understanding gene regulations. Although a number of motif-finding algorithms have been presented, most are either time consuming or have sub-optimal accuracy for processing large-scale datasets. In this article, we present a fully parallelized algorithm for detecting combinatorial motifs from ChIP-seq datasets by using Fisher combined method and OpenMP parallel design. Large scale validations on both synthetic data and 350 ChIP-seq datasets from the ENCODE database showed that FisherMP has not only super speeds on large datasets, but also has high accuracy when compared with multiple popular methods. By using FisherMP, we successfully detected combinatorial motifs of CTCF, YY1, MAZ, STAT3 and USF2 in chromosome X, suggesting that they are functional co-players in gene regulation and chromosomal organization. Integrative and statistical analysis of these TF-binding peaks clearly demonstrate that they are not only highly coordinated with each other, but that they are also correlated with histone modifications. FisherMP can be applied for integrative analysis of binding motifs and for predicting cis-regulatory modules from a large number of ChIP-seq datasets. Oxford University Press 2019-06 2019-04-08 /pmc/articles/PMC6589551/ /pubmed/30957858 http://dx.doi.org/10.1093/dnares/dsz004 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Kazusa DNA Research Institute. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Full Papers
Zhang, Shaoqiang
Liang, Ying
Wang, Xiangyun
Su, Zhengchang
Chen, Yong
FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets
title FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets
title_full FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets
title_fullStr FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets
title_full_unstemmed FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets
title_short FisherMP: fully parallel algorithm for detecting combinatorial motifs from large ChIP-seq datasets
title_sort fishermp: fully parallel algorithm for detecting combinatorial motifs from large chip-seq datasets
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6589551/
https://www.ncbi.nlm.nih.gov/pubmed/30957858
http://dx.doi.org/10.1093/dnares/dsz004
work_keys_str_mv AT zhangshaoqiang fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets
AT liangying fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets
AT wangxiangyun fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets
AT suzhengchang fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets
AT chenyong fishermpfullyparallelalgorithmfordetectingcombinatorialmotifsfromlargechipseqdatasets