Cargando…
Strategies for analyzing highly enriched IP-chip datasets
BACKGROUND: Chromatin immunoprecipitation on tiling arrays (ChIP-chip) has been employed to examine features such as protein binding and histone modifications on a genome-wide scale in a variety of cell types. Array data from the latter studies typically have a high proportion of enriched probes who...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759964/ https://www.ncbi.nlm.nih.gov/pubmed/19772646 http://dx.doi.org/10.1186/1471-2105-10-305 |
_version_ | 1782172716222119936 |
---|---|
author | Knott, Simon RV Viggiani, Christopher J Aparicio, Oscar M Tavaré, Simon |
author_facet | Knott, Simon RV Viggiani, Christopher J Aparicio, Oscar M Tavaré, Simon |
author_sort | Knott, Simon RV |
collection | PubMed |
description | BACKGROUND: Chromatin immunoprecipitation on tiling arrays (ChIP-chip) has been employed to examine features such as protein binding and histone modifications on a genome-wide scale in a variety of cell types. Array data from the latter studies typically have a high proportion of enriched probes whose signals vary considerably (due to heterogeneity in the cell population), and this makes their normalization and downstream analysis difficult. RESULTS: Here we present strategies for analyzing such experiments, focusing our discussion on the analysis of Bromodeoxyruridine (BrdU) immunoprecipitation on tiling array (BrdU-IP-chip) datasets. BrdU-IP-chip experiments map large, recently replicated genomic regions and have similar characteristics to histone modification/location data. To prepare such data for downstream analysis we employ a dynamic programming algorithm that identifies a set of putative unenriched probes, which we use for both within-array and between-array normalization. We also introduce a second dynamic programming algorithm that incorporates a priori knowledge to identify and quantify positive signals in these datasets. CONCLUSION: Highly enriched IP-chip datasets are often difficult to analyze with traditional array normalization and analysis strategies. Here we present and test a set of analytical tools for their normalization and quantification that allows for accurate identification and analysis of enriched regions. |
format | Text |
id | pubmed-2759964 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27599642009-10-11 Strategies for analyzing highly enriched IP-chip datasets Knott, Simon RV Viggiani, Christopher J Aparicio, Oscar M Tavaré, Simon BMC Bioinformatics Methodology Article BACKGROUND: Chromatin immunoprecipitation on tiling arrays (ChIP-chip) has been employed to examine features such as protein binding and histone modifications on a genome-wide scale in a variety of cell types. Array data from the latter studies typically have a high proportion of enriched probes whose signals vary considerably (due to heterogeneity in the cell population), and this makes their normalization and downstream analysis difficult. RESULTS: Here we present strategies for analyzing such experiments, focusing our discussion on the analysis of Bromodeoxyruridine (BrdU) immunoprecipitation on tiling array (BrdU-IP-chip) datasets. BrdU-IP-chip experiments map large, recently replicated genomic regions and have similar characteristics to histone modification/location data. To prepare such data for downstream analysis we employ a dynamic programming algorithm that identifies a set of putative unenriched probes, which we use for both within-array and between-array normalization. We also introduce a second dynamic programming algorithm that incorporates a priori knowledge to identify and quantify positive signals in these datasets. CONCLUSION: Highly enriched IP-chip datasets are often difficult to analyze with traditional array normalization and analysis strategies. Here we present and test a set of analytical tools for their normalization and quantification that allows for accurate identification and analysis of enriched regions. BioMed Central 2009-09-22 /pmc/articles/PMC2759964/ /pubmed/19772646 http://dx.doi.org/10.1186/1471-2105-10-305 Text en Copyright © 2009 Knott et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Knott, Simon RV Viggiani, Christopher J Aparicio, Oscar M Tavaré, Simon Strategies for analyzing highly enriched IP-chip datasets |
title | Strategies for analyzing highly enriched IP-chip datasets |
title_full | Strategies for analyzing highly enriched IP-chip datasets |
title_fullStr | Strategies for analyzing highly enriched IP-chip datasets |
title_full_unstemmed | Strategies for analyzing highly enriched IP-chip datasets |
title_short | Strategies for analyzing highly enriched IP-chip datasets |
title_sort | strategies for analyzing highly enriched ip-chip datasets |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2759964/ https://www.ncbi.nlm.nih.gov/pubmed/19772646 http://dx.doi.org/10.1186/1471-2105-10-305 |
work_keys_str_mv | AT knottsimonrv strategiesforanalyzinghighlyenrichedipchipdatasets AT viggianichristopherj strategiesforanalyzinghighlyenrichedipchipdatasets AT apariciooscarm strategiesforanalyzinghighlyenrichedipchipdatasets AT tavaresimon strategiesforanalyzinghighlyenrichedipchipdatasets |