Cargando…
GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments
Summary: Accurate prediction of transcription factor binding motifs that are enriched in a collection of sequences remains a computational challenge. Here we report on GimmeMotifs, a pipeline that incorporates an ensemble of computational tools to predict motifs de novo from ChIP-sequencing (ChIP-se...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3018809/ https://www.ncbi.nlm.nih.gov/pubmed/21081511 http://dx.doi.org/10.1093/bioinformatics/btq636 |
_version_ | 1782196121301417984 |
---|---|
author | van Heeringen, Simon J. Veenstra, Gert Jan C. |
author_facet | van Heeringen, Simon J. Veenstra, Gert Jan C. |
author_sort | van Heeringen, Simon J. |
collection | PubMed |
description | Summary: Accurate prediction of transcription factor binding motifs that are enriched in a collection of sequences remains a computational challenge. Here we report on GimmeMotifs, a pipeline that incorporates an ensemble of computational tools to predict motifs de novo from ChIP-sequencing (ChIP-seq) data. Similar redundant motifs are compared using the weighted information content (WIC) similarity score and clustered using an iterative procedure. A comprehensive output report is generated with several different evaluation metrics to compare and evaluate the results. Benchmarks show that the method performs well on human and mouse ChIP-seq datasets. GimmeMotifs consists of a suite of command-line scripts that can be easily implemented in a ChIP-seq analysis pipeline. Availability: GimmeMotifs is implemented in Python and runs on Linux. The source code is freely available for download at http://www.ncmls.eu/bioinfo/gimmemotifs/. Contact: s.vanheeringen@ncmls.ru.nl Supplementary Information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-3018809 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-30188092011-01-12 GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments van Heeringen, Simon J. Veenstra, Gert Jan C. Bioinformatics Applications Note Summary: Accurate prediction of transcription factor binding motifs that are enriched in a collection of sequences remains a computational challenge. Here we report on GimmeMotifs, a pipeline that incorporates an ensemble of computational tools to predict motifs de novo from ChIP-sequencing (ChIP-seq) data. Similar redundant motifs are compared using the weighted information content (WIC) similarity score and clustered using an iterative procedure. A comprehensive output report is generated with several different evaluation metrics to compare and evaluate the results. Benchmarks show that the method performs well on human and mouse ChIP-seq datasets. GimmeMotifs consists of a suite of command-line scripts that can be easily implemented in a ChIP-seq analysis pipeline. Availability: GimmeMotifs is implemented in Python and runs on Linux. The source code is freely available for download at http://www.ncmls.eu/bioinfo/gimmemotifs/. Contact: s.vanheeringen@ncmls.ru.nl Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-01-15 2010-11-15 /pmc/articles/PMC3018809/ /pubmed/21081511 http://dx.doi.org/10.1093/bioinformatics/btq636 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note van Heeringen, Simon J. Veenstra, Gert Jan C. GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments |
title | GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments |
title_full | GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments |
title_fullStr | GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments |
title_full_unstemmed | GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments |
title_short | GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments |
title_sort | gimmemotifs: a de novo motif prediction pipeline for chip-sequencing experiments |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3018809/ https://www.ncbi.nlm.nih.gov/pubmed/21081511 http://dx.doi.org/10.1093/bioinformatics/btq636 |
work_keys_str_mv | AT vanheeringensimonj gimmemotifsadenovomotifpredictionpipelineforchipsequencingexperiments AT veenstragertjanc gimmemotifsadenovomotifpredictionpipelineforchipsequencingexperiments |