Cargando…

CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis

Genome-wide profiling of DNA-binding proteins using ChIP-Seq has emerged as an alternative to ChIP-chip methods. ChIP-Seq technology offers many advantages over ChIP-chip arrays, including but not limited to less noise, higher resolution, and more coverage. Several algorithms have been developed to...

Descripción completa

Detalles Bibliográficos
Autores principales: Rezaeian, Iman, Rueda, Luis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3988018/
https://www.ncbi.nlm.nih.gov/pubmed/24736605
http://dx.doi.org/10.1371/journal.pone.0093873
_version_ 1782311963265597440
author Rezaeian, Iman
Rueda, Luis
author_facet Rezaeian, Iman
Rueda, Luis
author_sort Rezaeian, Iman
collection PubMed
description Genome-wide profiling of DNA-binding proteins using ChIP-Seq has emerged as an alternative to ChIP-chip methods. ChIP-Seq technology offers many advantages over ChIP-chip arrays, including but not limited to less noise, higher resolution, and more coverage. Several algorithms have been developed to take advantage of these abilities and find enriched regions by analyzing ChIP-Seq data. However, the complexity of analyzing various patterns of ChIP-Seq signals still needs the development of new algorithms. Most current algorithms use various heuristics to detect regions accurately. However, despite how many formulations are available, it is still difficult to accurately determine individual peaks corresponding to each binding event. We developed Constrained Multi-level Thresholding (CMT), an algorithm used to detect enriched regions on ChIP-Seq data. CMT employs a constraint-based module that can target regions within a specific range. We show that CMT has higher accuracy in detecting enriched regions (peaks) by objectively assessing its performance relative to other previously proposed peak finders. This is shown by testing three algorithms on the well-known FoxA1 Data set, four transcription factors (with a total of six antibodies) for Drosophila melanogaster and the H3K4ac antibody dataset.
format Online
Article
Text
id pubmed-3988018
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39880182014-04-21 CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis Rezaeian, Iman Rueda, Luis PLoS One Research Article Genome-wide profiling of DNA-binding proteins using ChIP-Seq has emerged as an alternative to ChIP-chip methods. ChIP-Seq technology offers many advantages over ChIP-chip arrays, including but not limited to less noise, higher resolution, and more coverage. Several algorithms have been developed to take advantage of these abilities and find enriched regions by analyzing ChIP-Seq data. However, the complexity of analyzing various patterns of ChIP-Seq signals still needs the development of new algorithms. Most current algorithms use various heuristics to detect regions accurately. However, despite how many formulations are available, it is still difficult to accurately determine individual peaks corresponding to each binding event. We developed Constrained Multi-level Thresholding (CMT), an algorithm used to detect enriched regions on ChIP-Seq data. CMT employs a constraint-based module that can target regions within a specific range. We show that CMT has higher accuracy in detecting enriched regions (peaks) by objectively assessing its performance relative to other previously proposed peak finders. This is shown by testing three algorithms on the well-known FoxA1 Data set, four transcription factors (with a total of six antibodies) for Drosophila melanogaster and the H3K4ac antibody dataset. Public Library of Science 2014-04-15 /pmc/articles/PMC3988018/ /pubmed/24736605 http://dx.doi.org/10.1371/journal.pone.0093873 Text en © 2014 Rezaeian, Rueda http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Rezaeian, Iman
Rueda, Luis
CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis
title CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis
title_full CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis
title_fullStr CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis
title_full_unstemmed CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis
title_short CMT: A Constrained Multi-Level Thresholding Approach for ChIP-Seq Data Analysis
title_sort cmt: a constrained multi-level thresholding approach for chip-seq data analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3988018/
https://www.ncbi.nlm.nih.gov/pubmed/24736605
http://dx.doi.org/10.1371/journal.pone.0093873
work_keys_str_mv AT rezaeianiman cmtaconstrainedmultilevelthresholdingapproachforchipseqdataanalysis
AT ruedaluis cmtaconstrainedmultilevelthresholdingapproachforchipseqdataanalysis