Cargando…

The Triform algorithm: improved sensitivity and specificity in ChIP-Seq peak finding

BACKGROUND: Chromatin immunoprecipitation combined with high-throughput sequencing (ChIP-Seq) is the most frequently used method to identify the binding sites of transcription factors. Active binding sites can be seen as peaks in enrichment profiles when the sequencing reads are mapped to a referenc...

Descripción completa

Detalles Bibliográficos
Autores principales: Kornacker, Karl, Rye, Morten Beck, Håndstad, Tony, Drabløs, Finn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480842/
https://www.ncbi.nlm.nih.gov/pubmed/22827163
http://dx.doi.org/10.1186/1471-2105-13-176
_version_ 1782247636573618176
author Kornacker, Karl
Rye, Morten Beck
Håndstad, Tony
Drabløs, Finn
author_facet Kornacker, Karl
Rye, Morten Beck
Håndstad, Tony
Drabløs, Finn
author_sort Kornacker, Karl
collection PubMed
description BACKGROUND: Chromatin immunoprecipitation combined with high-throughput sequencing (ChIP-Seq) is the most frequently used method to identify the binding sites of transcription factors. Active binding sites can be seen as peaks in enrichment profiles when the sequencing reads are mapped to a reference genome. However, the profiles are normally noisy, making it challenging to identify all significantly enriched regions in a reliable way and with an acceptable false discovery rate. RESULTS: We present the Triform algorithm, an improved approach to automatic peak finding in ChIP-Seq enrichment profiles for transcription factors. The method uses model-free statistics to identify peak-like distributions of sequencing reads, taking advantage of improved peak definition in combination with known characteristics of ChIP-Seq data. CONCLUSIONS: Triform outperforms several existing methods in the identification of representative peak profiles in curated benchmark data sets. We also show that Triform in many cases is able to identify peaks that are more consistent with biological function, compared with other methods. Finally, we show that Triform can be used to generate novel information on transcription factor binding in repeat regions, which represents a particular challenge in many ChIP-Seq experiments. The Triform algorithm has been implemented in R, and is available via http://tare.medisin.ntnu.no/triform.
format Online
Article
Text
id pubmed-3480842
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34808422012-11-02 The Triform algorithm: improved sensitivity and specificity in ChIP-Seq peak finding Kornacker, Karl Rye, Morten Beck Håndstad, Tony Drabløs, Finn BMC Bioinformatics Methodology Article BACKGROUND: Chromatin immunoprecipitation combined with high-throughput sequencing (ChIP-Seq) is the most frequently used method to identify the binding sites of transcription factors. Active binding sites can be seen as peaks in enrichment profiles when the sequencing reads are mapped to a reference genome. However, the profiles are normally noisy, making it challenging to identify all significantly enriched regions in a reliable way and with an acceptable false discovery rate. RESULTS: We present the Triform algorithm, an improved approach to automatic peak finding in ChIP-Seq enrichment profiles for transcription factors. The method uses model-free statistics to identify peak-like distributions of sequencing reads, taking advantage of improved peak definition in combination with known characteristics of ChIP-Seq data. CONCLUSIONS: Triform outperforms several existing methods in the identification of representative peak profiles in curated benchmark data sets. We also show that Triform in many cases is able to identify peaks that are more consistent with biological function, compared with other methods. Finally, we show that Triform can be used to generate novel information on transcription factor binding in repeat regions, which represents a particular challenge in many ChIP-Seq experiments. The Triform algorithm has been implemented in R, and is available via http://tare.medisin.ntnu.no/triform. BioMed Central 2012-07-24 /pmc/articles/PMC3480842/ /pubmed/22827163 http://dx.doi.org/10.1186/1471-2105-13-176 Text en Copyright ©2012 Kornacker et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Kornacker, Karl
Rye, Morten Beck
Håndstad, Tony
Drabløs, Finn
The Triform algorithm: improved sensitivity and specificity in ChIP-Seq peak finding
title The Triform algorithm: improved sensitivity and specificity in ChIP-Seq peak finding
title_full The Triform algorithm: improved sensitivity and specificity in ChIP-Seq peak finding
title_fullStr The Triform algorithm: improved sensitivity and specificity in ChIP-Seq peak finding
title_full_unstemmed The Triform algorithm: improved sensitivity and specificity in ChIP-Seq peak finding
title_short The Triform algorithm: improved sensitivity and specificity in ChIP-Seq peak finding
title_sort triform algorithm: improved sensitivity and specificity in chip-seq peak finding
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480842/
https://www.ncbi.nlm.nih.gov/pubmed/22827163
http://dx.doi.org/10.1186/1471-2105-13-176
work_keys_str_mv AT kornackerkarl thetriformalgorithmimprovedsensitivityandspecificityinchipseqpeakfinding
AT ryemortenbeck thetriformalgorithmimprovedsensitivityandspecificityinchipseqpeakfinding
AT handstadtony thetriformalgorithmimprovedsensitivityandspecificityinchipseqpeakfinding
AT drabløsfinn thetriformalgorithmimprovedsensitivityandspecificityinchipseqpeakfinding
AT kornackerkarl triformalgorithmimprovedsensitivityandspecificityinchipseqpeakfinding
AT ryemortenbeck triformalgorithmimprovedsensitivityandspecificityinchipseqpeakfinding
AT handstadtony triformalgorithmimprovedsensitivityandspecificityinchipseqpeakfinding
AT drabløsfinn triformalgorithmimprovedsensitivityandspecificityinchipseqpeakfinding