Cargando…
CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis
Genome-wide profiling of DNA-binding proteins using ChIP-Seq has emerged as an alternative to ChIP-chip methods. ChIP-Seq technology offers many advantages over ChIP-chip arrays, including but not limited to less noise, higher resolution, and more coverage. Several algorithms have been developed to...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3988018/ https://www.ncbi.nlm.nih.gov/pubmed/24736605 http://dx.doi.org/10.1371/journal.pone.0093873 |
_version_ | 1782311963265597440 |
---|---|
author | Rezaeian, Iman Rueda, Luis |
author_facet | Rezaeian, Iman Rueda, Luis |
author_sort | Rezaeian, Iman |
collection | PubMed |
description | Genome-wide profiling of DNA-binding proteins using ChIP-Seq has emerged as an alternative to ChIP-chip methods. ChIP-Seq technology offers many advantages over ChIP-chip arrays, including but not limited to less noise, higher resolution, and more coverage. Several algorithms have been developed to take advantage of these abilities and find enriched regions by analyzing ChIP-Seq data. However, the complexity of analyzing various patterns of ChIP-Seq signals still needs the development of new algorithms. Most current algorithms use various heuristics to detect regions accurately. However, despite how many formulations are available, it is still difficult to accurately determine individual peaks corresponding to each binding event. We developed Constrained Multi-level Thresholding (CMT), an algorithm used to detect enriched regions on ChIP-Seq data. CMT employs a constraint-based module that can target regions within a specific range. We show that CMT has higher accuracy in detecting enriched regions (peaks) by objectively assessing its performance relative to other previously proposed peak finders. This is shown by testing three algorithms on the well-known FoxA1 Data set, four transcription factors (with a total of six antibodies) for Drosophila melanogaster and the H3K4ac antibody dataset. |
format | Online Article Text |
id | pubmed-3988018 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-39880182014-04-21 CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis Rezaeian, Iman Rueda, Luis PLoS One Research Article Genome-wide profiling of DNA-binding proteins using ChIP-Seq has emerged as an alternative to ChIP-chip methods. ChIP-Seq technology offers many advantages over ChIP-chip arrays, including but not limited to less noise, higher resolution, and more coverage. Several algorithms have been developed to take advantage of these abilities and find enriched regions by analyzing ChIP-Seq data. However, the complexity of analyzing various patterns of ChIP-Seq signals still needs the development of new algorithms. Most current algorithms use various heuristics to detect regions accurately. However, despite how many formulations are available, it is still difficult to accurately determine individual peaks corresponding to each binding event. We developed Constrained Multi-level Thresholding (CMT), an algorithm used to detect enriched regions on ChIP-Seq data. CMT employs a constraint-based module that can target regions within a specific range. We show that CMT has higher accuracy in detecting enriched regions (peaks) by objectively assessing its performance relative to other previously proposed peak finders. This is shown by testing three algorithms on the well-known FoxA1 Data set, four transcription factors (with a total of six antibodies) for Drosophila melanogaster and the H3K4ac antibody dataset. Public Library of Science 2014-04-15 /pmc/articles/PMC3988018/ /pubmed/24736605 http://dx.doi.org/10.1371/journal.pone.0093873 Text en © 2014 Rezaeian, Rueda http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Rezaeian, Iman Rueda, Luis CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis |
title | CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis |
title_full | CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis |
title_fullStr | CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis |
title_full_unstemmed | CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis |
title_short | CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis |
title_sort | cmt: a constrained multi-level thresholding approach for chip-seq data analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3988018/ https://www.ncbi.nlm.nih.gov/pubmed/24736605 http://dx.doi.org/10.1371/journal.pone.0093873 |
work_keys_str_mv | AT rezaeianiman cmtaconstrainedmultilevelthresholdingapproachforchipseqdataanalysis AT ruedaluis cmtaconstrainedmultilevelthresholdingapproachforchipseqdataanalysis |