Cargando…
WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
Although discriminative motif discovery (DMD) methods are promising for eliciting motifs from high-throughput experimental data, due to consideration of computational expense, most of existing DMD methods have to choose approximate schemes that greatly restrict the search space, leading to significa...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468353/ https://www.ncbi.nlm.nih.gov/pubmed/28607381 http://dx.doi.org/10.1038/s41598-017-03554-7 |
_version_ | 1783243420861464576 |
---|---|
author | Zhang, Hongbo Zhu, Lin Huang, De-Shuang |
author_facet | Zhang, Hongbo Zhu, Lin Huang, De-Shuang |
author_sort | Zhang, Hongbo |
collection | PubMed |
description | Although discriminative motif discovery (DMD) methods are promising for eliciting motifs from high-throughput experimental data, due to consideration of computational expense, most of existing DMD methods have to choose approximate schemes that greatly restrict the search space, leading to significant loss of predictive accuracy. In this paper, we propose Weakly-Supervised Motif Discovery (WSMD) to discover motifs from ChIP-seq datasets. In contrast to the learning strategies adopted by previous DMD methods, WSMD allows a “global” optimization scheme of the motif parameters in continuous space, thereby reducing the information loss of model representation and improving the quality of resultant motifs. Meanwhile, by exploiting the connection between DMD framework and existing weakly supervised learning (WSL) technologies, we also present highly scalable learning strategies for the proposed method. The experimental results on both real ChIP-seq datasets and synthetic datasets show that WSMD substantially outperforms former DMD methods (including DREME, HOMER, XXmotif, motifRG and DECOD) in terms of predictive accuracy, while also achieving a competitive computational speed. |
format | Online Article Text |
id | pubmed-5468353 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-54683532017-06-14 WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data Zhang, Hongbo Zhu, Lin Huang, De-Shuang Sci Rep Article Although discriminative motif discovery (DMD) methods are promising for eliciting motifs from high-throughput experimental data, due to consideration of computational expense, most of existing DMD methods have to choose approximate schemes that greatly restrict the search space, leading to significant loss of predictive accuracy. In this paper, we propose Weakly-Supervised Motif Discovery (WSMD) to discover motifs from ChIP-seq datasets. In contrast to the learning strategies adopted by previous DMD methods, WSMD allows a “global” optimization scheme of the motif parameters in continuous space, thereby reducing the information loss of model representation and improving the quality of resultant motifs. Meanwhile, by exploiting the connection between DMD framework and existing weakly supervised learning (WSL) technologies, we also present highly scalable learning strategies for the proposed method. The experimental results on both real ChIP-seq datasets and synthetic datasets show that WSMD substantially outperforms former DMD methods (including DREME, HOMER, XXmotif, motifRG and DECOD) in terms of predictive accuracy, while also achieving a competitive computational speed. Nature Publishing Group UK 2017-06-12 /pmc/articles/PMC5468353/ /pubmed/28607381 http://dx.doi.org/10.1038/s41598-017-03554-7 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Zhang, Hongbo Zhu, Lin Huang, De-Shuang WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data |
title | WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data |
title_full | WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data |
title_fullStr | WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data |
title_full_unstemmed | WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data |
title_short | WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data |
title_sort | wsmd: weakly-supervised motif discovery in transcription factor chip-seq data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468353/ https://www.ncbi.nlm.nih.gov/pubmed/28607381 http://dx.doi.org/10.1038/s41598-017-03554-7 |
work_keys_str_mv | AT zhanghongbo wsmdweaklysupervisedmotifdiscoveryintranscriptionfactorchipseqdata AT zhulin wsmdweaklysupervisedmotifdiscoveryintranscriptionfactorchipseqdata AT huangdeshuang wsmdweaklysupervisedmotifdiscoveryintranscriptionfactorchipseqdata |