Cargando…

CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data

BACKGROUND: Comparative genomic hybridization can rapidly identify chromosomal regions that vary between organisms and tissues. This technique has been applied to detecting differences between normal and cancerous tissues in eukaryotes as well as genomic variability in microbial strains and species....

Descripción completa

Detalles Bibliográficos
Autores principales: Anderson, Bradley D, Gilson, Michael C, Scott, Abigail A, Biehl, Bryan S, Glasner, Jeremy D, Rajashekara, Gireesh, Splitter, Gary A, Perna, Nicole T
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1464128/
https://www.ncbi.nlm.nih.gov/pubmed/16638145
http://dx.doi.org/10.1186/1471-2164-7-91
_version_ 1782127539133612032
author Anderson, Bradley D
Gilson, Michael C
Scott, Abigail A
Biehl, Bryan S
Glasner, Jeremy D
Rajashekara, Gireesh
Splitter, Gary A
Perna, Nicole T
author_facet Anderson, Bradley D
Gilson, Michael C
Scott, Abigail A
Biehl, Bryan S
Glasner, Jeremy D
Rajashekara, Gireesh
Splitter, Gary A
Perna, Nicole T
author_sort Anderson, Bradley D
collection PubMed
description BACKGROUND: Comparative genomic hybridization can rapidly identify chromosomal regions that vary between organisms and tissues. This technique has been applied to detecting differences between normal and cancerous tissues in eukaryotes as well as genomic variability in microbial strains and species. The density of oligonucleotide probes available on current microarray platforms is particularly well-suited for comparisons of organisms with smaller genomes like bacteria and yeast where an entire genome can be assayed on a single microarray with high resolution. Available methods for analyzing these experiments typically confine analyses to data from pre-defined annotated genome features, such as entire genes. Many of these methods are ill suited for datasets with the number of measurements typical of high-density microarrays. RESULTS: We present an algorithm for analyzing microarray hybridization data to aid identification of regions that vary between an unsequenced genome and a sequenced reference genome. The program, CGHScan, uses an iterative random walk approach integrating multi-layered significance testing to detect these regions from comparative genomic hybridization data. The algorithm tolerates a high level of noise in measurements of individual probe intensities and is relatively insensitive to the choice of method for normalizing probe intensity values and identifying probes that differ between samples. When applied to comparative genomic hybridization data from a published experiment, CGHScan identified eight of nine known deletions in a Brucella ovis strain as compared to Brucella melitensis. The same result was obtained using two different normalization methods and two different scores to classify data for individual probes as representing conserved or variable genomic regions. The undetected region is a small (58 base pair) deletion that is below the resolution of CGHScan given the array design employed in the study. CONCLUSION: CGHScan is an effective tool for analyzing comparative genomic hybridization data from high-density microarrays. The algorithm is capable of accurately identifying known variable regions and is tolerant of high noise and varying methods of data preprocessing. Statistical analysis is used to define each variable region providing a robust and reliable method for rapid identification of genomic differences independent of annotated gene boundaries.
format Text
id pubmed-1464128
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14641282006-05-23 CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data Anderson, Bradley D Gilson, Michael C Scott, Abigail A Biehl, Bryan S Glasner, Jeremy D Rajashekara, Gireesh Splitter, Gary A Perna, Nicole T BMC Genomics Software BACKGROUND: Comparative genomic hybridization can rapidly identify chromosomal regions that vary between organisms and tissues. This technique has been applied to detecting differences between normal and cancerous tissues in eukaryotes as well as genomic variability in microbial strains and species. The density of oligonucleotide probes available on current microarray platforms is particularly well-suited for comparisons of organisms with smaller genomes like bacteria and yeast where an entire genome can be assayed on a single microarray with high resolution. Available methods for analyzing these experiments typically confine analyses to data from pre-defined annotated genome features, such as entire genes. Many of these methods are ill suited for datasets with the number of measurements typical of high-density microarrays. RESULTS: We present an algorithm for analyzing microarray hybridization data to aid identification of regions that vary between an unsequenced genome and a sequenced reference genome. The program, CGHScan, uses an iterative random walk approach integrating multi-layered significance testing to detect these regions from comparative genomic hybridization data. The algorithm tolerates a high level of noise in measurements of individual probe intensities and is relatively insensitive to the choice of method for normalizing probe intensity values and identifying probes that differ between samples. When applied to comparative genomic hybridization data from a published experiment, CGHScan identified eight of nine known deletions in a Brucella ovis strain as compared to Brucella melitensis. The same result was obtained using two different normalization methods and two different scores to classify data for individual probes as representing conserved or variable genomic regions. The undetected region is a small (58 base pair) deletion that is below the resolution of CGHScan given the array design employed in the study. CONCLUSION: CGHScan is an effective tool for analyzing comparative genomic hybridization data from high-density microarrays. The algorithm is capable of accurately identifying known variable regions and is tolerant of high noise and varying methods of data preprocessing. Statistical analysis is used to define each variable region providing a robust and reliable method for rapid identification of genomic differences independent of annotated gene boundaries. BioMed Central 2006-04-25 /pmc/articles/PMC1464128/ /pubmed/16638145 http://dx.doi.org/10.1186/1471-2164-7-91 Text en Copyright © 2006 Anderson et al; licensee BioMed Central Ltd.
spellingShingle Software
Anderson, Bradley D
Gilson, Michael C
Scott, Abigail A
Biehl, Bryan S
Glasner, Jeremy D
Rajashekara, Gireesh
Splitter, Gary A
Perna, Nicole T
CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data
title CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data
title_full CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data
title_fullStr CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data
title_full_unstemmed CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data
title_short CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data
title_sort cghscan: finding variable regions using high-density microarray comparative genomic hybridization data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1464128/
https://www.ncbi.nlm.nih.gov/pubmed/16638145
http://dx.doi.org/10.1186/1471-2164-7-91
work_keys_str_mv AT andersonbradleyd cghscanfindingvariableregionsusinghighdensitymicroarraycomparativegenomichybridizationdata
AT gilsonmichaelc cghscanfindingvariableregionsusinghighdensitymicroarraycomparativegenomichybridizationdata
AT scottabigaila cghscanfindingvariableregionsusinghighdensitymicroarraycomparativegenomichybridizationdata
AT biehlbryans cghscanfindingvariableregionsusinghighdensitymicroarraycomparativegenomichybridizationdata
AT glasnerjeremyd cghscanfindingvariableregionsusinghighdensitymicroarraycomparativegenomichybridizationdata
AT rajashekaragireesh cghscanfindingvariableregionsusinghighdensitymicroarraycomparativegenomichybridizationdata
AT splittergarya cghscanfindingvariableregionsusinghighdensitymicroarraycomparativegenomichybridizationdata
AT pernanicolet cghscanfindingvariableregionsusinghighdensitymicroarraycomparativegenomichybridizationdata