Cargando…

Strategies for analyzing highly enriched IP-chip datasets

BACKGROUND: Chromatin immunoprecipitation on tiling arrays (ChIP-chip) has been employed to examine features such as protein binding and histone modifications on a genome-wide scale in a variety of cell types. Array data from the latter studies typically have a high proportion of enriched probes who...

Descripción completa

Detalles Bibliográficos
Autores principales: Knott, Simon RV, Viggiani, Christopher J, Aparicio, Oscar M, Tavaré, Simon
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759964/
https://www.ncbi.nlm.nih.gov/pubmed/19772646
http://dx.doi.org/10.1186/1471-2105-10-305
_version_ 1782172716222119936
author Knott, Simon RV
Viggiani, Christopher J
Aparicio, Oscar M
Tavaré, Simon
author_facet Knott, Simon RV
Viggiani, Christopher J
Aparicio, Oscar M
Tavaré, Simon
author_sort Knott, Simon RV
collection PubMed
description BACKGROUND: Chromatin immunoprecipitation on tiling arrays (ChIP-chip) has been employed to examine features such as protein binding and histone modifications on a genome-wide scale in a variety of cell types. Array data from the latter studies typically have a high proportion of enriched probes whose signals vary considerably (due to heterogeneity in the cell population), and this makes their normalization and downstream analysis difficult. RESULTS: Here we present strategies for analyzing such experiments, focusing our discussion on the analysis of Bromodeoxyruridine (BrdU) immunoprecipitation on tiling array (BrdU-IP-chip) datasets. BrdU-IP-chip experiments map large, recently replicated genomic regions and have similar characteristics to histone modification/location data. To prepare such data for downstream analysis we employ a dynamic programming algorithm that identifies a set of putative unenriched probes, which we use for both within-array and between-array normalization. We also introduce a second dynamic programming algorithm that incorporates a priori knowledge to identify and quantify positive signals in these datasets. CONCLUSION: Highly enriched IP-chip datasets are often difficult to analyze with traditional array normalization and analysis strategies. Here we present and test a set of analytical tools for their normalization and quantification that allows for accurate identification and analysis of enriched regions.
format Text
id pubmed-2759964
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27599642009-10-11 Strategies for analyzing highly enriched IP-chip datasets Knott, Simon RV Viggiani, Christopher J Aparicio, Oscar M Tavaré, Simon BMC Bioinformatics Methodology Article BACKGROUND: Chromatin immunoprecipitation on tiling arrays (ChIP-chip) has been employed to examine features such as protein binding and histone modifications on a genome-wide scale in a variety of cell types. Array data from the latter studies typically have a high proportion of enriched probes whose signals vary considerably (due to heterogeneity in the cell population), and this makes their normalization and downstream analysis difficult. RESULTS: Here we present strategies for analyzing such experiments, focusing our discussion on the analysis of Bromodeoxyruridine (BrdU) immunoprecipitation on tiling array (BrdU-IP-chip) datasets. BrdU-IP-chip experiments map large, recently replicated genomic regions and have similar characteristics to histone modification/location data. To prepare such data for downstream analysis we employ a dynamic programming algorithm that identifies a set of putative unenriched probes, which we use for both within-array and between-array normalization. We also introduce a second dynamic programming algorithm that incorporates a priori knowledge to identify and quantify positive signals in these datasets. CONCLUSION: Highly enriched IP-chip datasets are often difficult to analyze with traditional array normalization and analysis strategies. Here we present and test a set of analytical tools for their normalization and quantification that allows for accurate identification and analysis of enriched regions. BioMed Central 2009-09-22 /pmc/articles/PMC2759964/ /pubmed/19772646 http://dx.doi.org/10.1186/1471-2105-10-305 Text en Copyright © 2009 Knott et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Knott, Simon RV
Viggiani, Christopher J
Aparicio, Oscar M
Tavaré, Simon
Strategies for analyzing highly enriched IP-chip datasets
title Strategies for analyzing highly enriched IP-chip datasets
title_full Strategies for analyzing highly enriched IP-chip datasets
title_fullStr Strategies for analyzing highly enriched IP-chip datasets
title_full_unstemmed Strategies for analyzing highly enriched IP-chip datasets
title_short Strategies for analyzing highly enriched IP-chip datasets
title_sort strategies for analyzing highly enriched ip-chip datasets
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759964/
https://www.ncbi.nlm.nih.gov/pubmed/19772646
http://dx.doi.org/10.1186/1471-2105-10-305
work_keys_str_mv AT knottsimonrv strategiesforanalyzinghighlyenrichedipchipdatasets
AT viggianichristopherj strategiesforanalyzinghighlyenrichedipchipdatasets
AT apariciooscarm strategiesforanalyzinghighlyenrichedipchipdatasets
AT tavaresimon strategiesforanalyzinghighlyenrichedipchipdatasets