Cargando…

WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data

Although discriminative motif discovery (DMD) methods are promising for eliciting motifs from high-throughput experimental data, due to consideration of computational expense, most of existing DMD methods have to choose approximate schemes that greatly restrict the search space, leading to significa...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Hongbo, Zhu, Lin, Huang, De-Shuang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468353/
https://www.ncbi.nlm.nih.gov/pubmed/28607381
http://dx.doi.org/10.1038/s41598-017-03554-7
_version_ 1783243420861464576
author Zhang, Hongbo
Zhu, Lin
Huang, De-Shuang
author_facet Zhang, Hongbo
Zhu, Lin
Huang, De-Shuang
author_sort Zhang, Hongbo
collection PubMed
description Although discriminative motif discovery (DMD) methods are promising for eliciting motifs from high-throughput experimental data, due to consideration of computational expense, most of existing DMD methods have to choose approximate schemes that greatly restrict the search space, leading to significant loss of predictive accuracy. In this paper, we propose Weakly-Supervised Motif Discovery (WSMD) to discover motifs from ChIP-seq datasets. In contrast to the learning strategies adopted by previous DMD methods, WSMD allows a “global” optimization scheme of the motif parameters in continuous space, thereby reducing the information loss of model representation and improving the quality of resultant motifs. Meanwhile, by exploiting the connection between DMD framework and existing weakly supervised learning (WSL) technologies, we also present highly scalable learning strategies for the proposed method. The experimental results on both real ChIP-seq datasets and synthetic datasets show that WSMD substantially outperforms former DMD methods (including DREME, HOMER, XXmotif, motifRG and DECOD) in terms of predictive accuracy, while also achieving a competitive computational speed.
format Online
Article
Text
id pubmed-5468353
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-54683532017-06-14 WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data Zhang, Hongbo Zhu, Lin Huang, De-Shuang Sci Rep Article Although discriminative motif discovery (DMD) methods are promising for eliciting motifs from high-throughput experimental data, due to consideration of computational expense, most of existing DMD methods have to choose approximate schemes that greatly restrict the search space, leading to significant loss of predictive accuracy. In this paper, we propose Weakly-Supervised Motif Discovery (WSMD) to discover motifs from ChIP-seq datasets. In contrast to the learning strategies adopted by previous DMD methods, WSMD allows a “global” optimization scheme of the motif parameters in continuous space, thereby reducing the information loss of model representation and improving the quality of resultant motifs. Meanwhile, by exploiting the connection between DMD framework and existing weakly supervised learning (WSL) technologies, we also present highly scalable learning strategies for the proposed method. The experimental results on both real ChIP-seq datasets and synthetic datasets show that WSMD substantially outperforms former DMD methods (including DREME, HOMER, XXmotif, motifRG and DECOD) in terms of predictive accuracy, while also achieving a competitive computational speed. Nature Publishing Group UK 2017-06-12 /pmc/articles/PMC5468353/ /pubmed/28607381 http://dx.doi.org/10.1038/s41598-017-03554-7 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Zhang, Hongbo
Zhu, Lin
Huang, De-Shuang
WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
title WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
title_full WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
title_fullStr WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
title_full_unstemmed WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
title_short WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
title_sort wsmd: weakly-supervised motif discovery in transcription factor chip-seq data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468353/
https://www.ncbi.nlm.nih.gov/pubmed/28607381
http://dx.doi.org/10.1038/s41598-017-03554-7
work_keys_str_mv AT zhanghongbo wsmdweaklysupervisedmotifdiscoveryintranscriptionfactorchipseqdata
AT zhulin wsmdweaklysupervisedmotifdiscoveryintranscriptionfactorchipseqdata
AT huangdeshuang wsmdweaklysupervisedmotifdiscoveryintranscriptionfactorchipseqdata