Cargando…

A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs

Chromatin immunoprecipitation (ChIP) followed by high throughput sequencing (ChIP-seq) is rapidly becoming the method of choice for discovering cell-specific transcription factor binding locations genome wide. By aligning sequenced tags to the genome, binding locations appear as peaks in the tag pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Rye, Morten Beck, Sætrom, Pål, Drabløs, Finn
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3045577/
https://www.ncbi.nlm.nih.gov/pubmed/21113027
http://dx.doi.org/10.1093/nar/gkq1187
_version_ 1782198847884230656
author Rye, Morten Beck
Sætrom, Pål
Drabløs, Finn
author_facet Rye, Morten Beck
Sætrom, Pål
Drabløs, Finn
author_sort Rye, Morten Beck
collection PubMed
description Chromatin immunoprecipitation (ChIP) followed by high throughput sequencing (ChIP-seq) is rapidly becoming the method of choice for discovering cell-specific transcription factor binding locations genome wide. By aligning sequenced tags to the genome, binding locations appear as peaks in the tag profile. Several programs have been designed to identify such peaks, but program evaluation has been difficult due to the lack of benchmark data sets. We have created benchmark data sets for three transcription factors by manually evaluating a selection of potential binding regions that cover typical variation in peak size and appearance. Performance of five programs on this benchmark showed, first, that external control or background data was essential to limit the number of false positive peaks from the programs. However, >80% of these peaks could be manually filtered out by visual inspection alone, without using additional background data, showing that peak shape information is not fully exploited in the evaluated programs. Second, none of the programs returned peak-regions that corresponded to the actual resolution in ChIP-seq data. Our results showed that ChIP-seq peaks should be narrowed down to 100–400 bp, which is sufficient to identify unique peaks and binding sites. Based on these results, we propose a meta-approach that gives improved peak definitions.
format Text
id pubmed-3045577
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30455772011-02-28 A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs Rye, Morten Beck Sætrom, Pål Drabløs, Finn Nucleic Acids Res Methods Online Chromatin immunoprecipitation (ChIP) followed by high throughput sequencing (ChIP-seq) is rapidly becoming the method of choice for discovering cell-specific transcription factor binding locations genome wide. By aligning sequenced tags to the genome, binding locations appear as peaks in the tag profile. Several programs have been designed to identify such peaks, but program evaluation has been difficult due to the lack of benchmark data sets. We have created benchmark data sets for three transcription factors by manually evaluating a selection of potential binding regions that cover typical variation in peak size and appearance. Performance of five programs on this benchmark showed, first, that external control or background data was essential to limit the number of false positive peaks from the programs. However, >80% of these peaks could be manually filtered out by visual inspection alone, without using additional background data, showing that peak shape information is not fully exploited in the evaluated programs. Second, none of the programs returned peak-regions that corresponded to the actual resolution in ChIP-seq data. Our results showed that ChIP-seq peaks should be narrowed down to 100–400 bp, which is sufficient to identify unique peaks and binding sites. Based on these results, we propose a meta-approach that gives improved peak definitions. Oxford University Press 2011-03 2010-11-26 /pmc/articles/PMC3045577/ /pubmed/21113027 http://dx.doi.org/10.1093/nar/gkq1187 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Rye, Morten Beck
Sætrom, Pål
Drabløs, Finn
A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs
title A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs
title_full A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs
title_fullStr A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs
title_full_unstemmed A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs
title_short A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs
title_sort manually curated chip-seq benchmark demonstrates room for improvement in current peak-finder programs
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3045577/
https://www.ncbi.nlm.nih.gov/pubmed/21113027
http://dx.doi.org/10.1093/nar/gkq1187
work_keys_str_mv AT ryemortenbeck amanuallycuratedchipseqbenchmarkdemonstratesroomforimprovementincurrentpeakfinderprograms
AT sætrompal amanuallycuratedchipseqbenchmarkdemonstratesroomforimprovementincurrentpeakfinderprograms
AT drabløsfinn amanuallycuratedchipseqbenchmarkdemonstratesroomforimprovementincurrentpeakfinderprograms
AT ryemortenbeck manuallycuratedchipseqbenchmarkdemonstratesroomforimprovementincurrentpeakfinderprograms
AT sætrompal manuallycuratedchipseqbenchmarkdemonstratesroomforimprovementincurrentpeakfinderprograms
AT drabløsfinn manuallycuratedchipseqbenchmarkdemonstratesroomforimprovementincurrentpeakfinderprograms