Cargando…

Rank-statistics based enrichment-site prediction algorithm developed for chromatin immunoprecipitation on chip experiments

BACKGROUND: High density oligonucleotide tiling arrays are an effective and powerful platform for conducting unbiased genome-wide studies. The ab initio probe selection method employed in tiling arrays is unbiased, and thus ensures consistent sampling across coding and non-coding regions of the geno...

Descripción completa

Detalles Bibliográficos
Autores principales: Ghosh, Srinka, Hirsch, Heather A, Sekinger, Edward, Struhl, Kevin, Gingeras, Thomas R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1615882/
https://www.ncbi.nlm.nih.gov/pubmed/17022824
http://dx.doi.org/10.1186/1471-2105-7-434
_version_ 1782130492811771904
author Ghosh, Srinka
Hirsch, Heather A
Sekinger, Edward
Struhl, Kevin
Gingeras, Thomas R
author_facet Ghosh, Srinka
Hirsch, Heather A
Sekinger, Edward
Struhl, Kevin
Gingeras, Thomas R
author_sort Ghosh, Srinka
collection PubMed
description BACKGROUND: High density oligonucleotide tiling arrays are an effective and powerful platform for conducting unbiased genome-wide studies. The ab initio probe selection method employed in tiling arrays is unbiased, and thus ensures consistent sampling across coding and non-coding regions of the genome. Tiling arrays are increasingly used in chromatin immunoprecipitation (IP) experiments (ChIP on chip). ChIP on chip facilitates the generation of genome-wide maps of in-vivo interactions between DNA-associated proteins including transcription factors and DNA. Analysis of the hybridization of an immunoprecipitated sample to a tiling array facilitates the identification of ChIP-enriched segments of the genome. These enriched segments are putative targets of antibody assayable regulatory elements. The enrichment response is not ubiquitous across the genome. Typically 5 to 10% of tiled probes manifest some significant enrichment. Depending upon the factor being studied, this response can drop to less than 1%. The detection and assessment of significance for interactions that emanate from non-canonical and/or un-annotated regions of the genome is especially challenging. This is the motivation behind the proposed algorithm. RESULTS: We have proposed a novel rank and replicate statistics-based methodology for identifying and ascribing statistical confidence to regions of ChIP-enrichment. The algorithm is optimized for identification of sites that manifest low levels of enrichment but are true positives, as validated by alternative biochemical experiments. Although the method is described here in the context of ChIP on chip experiments, it can be generalized to any treatment-control experimental design. The results of the algorithm show a high degree of concordance with independent biochemical validation methods. The sensitivity and specificity of the algorithm have been characterized via quantitative PCR and independent computational approaches. CONCLUSION: The algorithm ranks all enrichment sites based on their intra-replicate ranks and inter-replicate rank consistency. Following the ranking, the method allows segmentation of sites based on a meta p-value, a composite array signal enrichment criterion, or a composite of these two measures. The sensitivities obtained subsequent to the segmentation of data using a meta p-value of 10(-5), an array signal enrichment of 0.2 and a composite of these two values are 88%, 87% and 95%, respectively.
format Text
id pubmed-1615882
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16158822006-10-18 Rank-statistics based enrichment-site prediction algorithm developed for chromatin immunoprecipitation on chip experiments Ghosh, Srinka Hirsch, Heather A Sekinger, Edward Struhl, Kevin Gingeras, Thomas R BMC Bioinformatics Methodology Article BACKGROUND: High density oligonucleotide tiling arrays are an effective and powerful platform for conducting unbiased genome-wide studies. The ab initio probe selection method employed in tiling arrays is unbiased, and thus ensures consistent sampling across coding and non-coding regions of the genome. Tiling arrays are increasingly used in chromatin immunoprecipitation (IP) experiments (ChIP on chip). ChIP on chip facilitates the generation of genome-wide maps of in-vivo interactions between DNA-associated proteins including transcription factors and DNA. Analysis of the hybridization of an immunoprecipitated sample to a tiling array facilitates the identification of ChIP-enriched segments of the genome. These enriched segments are putative targets of antibody assayable regulatory elements. The enrichment response is not ubiquitous across the genome. Typically 5 to 10% of tiled probes manifest some significant enrichment. Depending upon the factor being studied, this response can drop to less than 1%. The detection and assessment of significance for interactions that emanate from non-canonical and/or un-annotated regions of the genome is especially challenging. This is the motivation behind the proposed algorithm. RESULTS: We have proposed a novel rank and replicate statistics-based methodology for identifying and ascribing statistical confidence to regions of ChIP-enrichment. The algorithm is optimized for identification of sites that manifest low levels of enrichment but are true positives, as validated by alternative biochemical experiments. Although the method is described here in the context of ChIP on chip experiments, it can be generalized to any treatment-control experimental design. The results of the algorithm show a high degree of concordance with independent biochemical validation methods. The sensitivity and specificity of the algorithm have been characterized via quantitative PCR and independent computational approaches. CONCLUSION: The algorithm ranks all enrichment sites based on their intra-replicate ranks and inter-replicate rank consistency. Following the ranking, the method allows segmentation of sites based on a meta p-value, a composite array signal enrichment criterion, or a composite of these two measures. The sensitivities obtained subsequent to the segmentation of data using a meta p-value of 10(-5), an array signal enrichment of 0.2 and a composite of these two values are 88%, 87% and 95%, respectively. BioMed Central 2006-10-05 /pmc/articles/PMC1615882/ /pubmed/17022824 http://dx.doi.org/10.1186/1471-2105-7-434 Text en Copyright © 2006 Ghosh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Ghosh, Srinka
Hirsch, Heather A
Sekinger, Edward
Struhl, Kevin
Gingeras, Thomas R
Rank-statistics based enrichment-site prediction algorithm developed for chromatin immunoprecipitation on chip experiments
title Rank-statistics based enrichment-site prediction algorithm developed for chromatin immunoprecipitation on chip experiments
title_full Rank-statistics based enrichment-site prediction algorithm developed for chromatin immunoprecipitation on chip experiments
title_fullStr Rank-statistics based enrichment-site prediction algorithm developed for chromatin immunoprecipitation on chip experiments
title_full_unstemmed Rank-statistics based enrichment-site prediction algorithm developed for chromatin immunoprecipitation on chip experiments
title_short Rank-statistics based enrichment-site prediction algorithm developed for chromatin immunoprecipitation on chip experiments
title_sort rank-statistics based enrichment-site prediction algorithm developed for chromatin immunoprecipitation on chip experiments
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1615882/
https://www.ncbi.nlm.nih.gov/pubmed/17022824
http://dx.doi.org/10.1186/1471-2105-7-434
work_keys_str_mv AT ghoshsrinka rankstatisticsbasedenrichmentsitepredictionalgorithmdevelopedforchromatinimmunoprecipitationonchipexperiments
AT hirschheathera rankstatisticsbasedenrichmentsitepredictionalgorithmdevelopedforchromatinimmunoprecipitationonchipexperiments
AT sekingeredward rankstatisticsbasedenrichmentsitepredictionalgorithmdevelopedforchromatinimmunoprecipitationonchipexperiments
AT struhlkevin rankstatisticsbasedenrichmentsitepredictionalgorithmdevelopedforchromatinimmunoprecipitationonchipexperiments
AT gingerasthomasr rankstatisticsbasedenrichmentsitepredictionalgorithmdevelopedforchromatinimmunoprecipitationonchipexperiments