Cargando…

A new approach for the analysis of bacterial microarray-based Comparative Genomic Hybridization: insights from an empirical study

BACKGROUND: Microarray-based Comparative Genomic Hybridization (M-CGH) has been used to characterize the extensive intraspecies genetic diversity found in bacteria at the whole-genome level. Although conventional microarray analytical procedures have proved adequate in handling M-CGH data, data inte...

Descripción completa

Detalles Bibliográficos
Autores principales: Taboada, Eduardo N, Acedillo, Rey R, Luebbert, Christian C, Findlay, Wendy A, Nash, John HE
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1168892/
https://www.ncbi.nlm.nih.gov/pubmed/15918914
http://dx.doi.org/10.1186/1471-2164-6-78
_version_ 1782124437813854208
author Taboada, Eduardo N
Acedillo, Rey R
Luebbert, Christian C
Findlay, Wendy A
Nash, John HE
author_facet Taboada, Eduardo N
Acedillo, Rey R
Luebbert, Christian C
Findlay, Wendy A
Nash, John HE
author_sort Taboada, Eduardo N
collection PubMed
description BACKGROUND: Microarray-based Comparative Genomic Hybridization (M-CGH) has been used to characterize the extensive intraspecies genetic diversity found in bacteria at the whole-genome level. Although conventional microarray analytical procedures have proved adequate in handling M-CGH data, data interpretation using these methods is based on a continuous character model in which gene divergence and gene absence form a spectrum of decreasing gene conservation levels. However, whereas gene divergence may yet be accompanied by retention in gene function, gene absence invariably leads to loss of function. This distinction, if ignored, leads to a loss in the information to be gained from M-CGH data. We present here results from experiments in which two genome-sequenced strains of C. jejuni were compared against each other using M-CGH. Because the gene content of both strains was known a priori, we were able to closely examine the effects of sequence divergence and gene absence on M-CGH data in order to define analytical parameters for M-CGH data interpretation. This would facilitate the examination of the relative effects of sequence divergence or gene absence in comparative genomics analyses of multiple strains of any species for which genome sequence data and a DNA microarray are available. RESULTS: As a first step towards improving the analysis of M-CGH data, we estimated the degree of experimental error in a series of experiments in which identical samples were compared against each other by M-CGH. This variance estimate was used to validate a Log Ratio-based methodology for identification of outliers in M-CGH data. We compared two genome strains by M-CGH to examine the effect of probe/target identity on the Log Ratios of signal intensities using prior knowledge of gene divergence and gene absence to establish Log Ratio thresholds for the identification of absent and conserved genes. CONCLUSION: The results from this empirical study validate the Log Ratio thresholds that have been used in other studies to establish gene divergence/absence. Moreover, the analytical framework presented here enhances the information content derived from M-CGH data by shifting the focus from divergent/absent gene detection to accurate detection of conserved and absent genes. This approach closely aligns the technical limitations of M-CGH analysis with practical limitations on the biological interpretation of comparative genomics data.
format Text
id pubmed-1168892
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11688922005-07-02 A new approach for the analysis of bacterial microarray-based Comparative Genomic Hybridization: insights from an empirical study Taboada, Eduardo N Acedillo, Rey R Luebbert, Christian C Findlay, Wendy A Nash, John HE BMC Genomics Research Article BACKGROUND: Microarray-based Comparative Genomic Hybridization (M-CGH) has been used to characterize the extensive intraspecies genetic diversity found in bacteria at the whole-genome level. Although conventional microarray analytical procedures have proved adequate in handling M-CGH data, data interpretation using these methods is based on a continuous character model in which gene divergence and gene absence form a spectrum of decreasing gene conservation levels. However, whereas gene divergence may yet be accompanied by retention in gene function, gene absence invariably leads to loss of function. This distinction, if ignored, leads to a loss in the information to be gained from M-CGH data. We present here results from experiments in which two genome-sequenced strains of C. jejuni were compared against each other using M-CGH. Because the gene content of both strains was known a priori, we were able to closely examine the effects of sequence divergence and gene absence on M-CGH data in order to define analytical parameters for M-CGH data interpretation. This would facilitate the examination of the relative effects of sequence divergence or gene absence in comparative genomics analyses of multiple strains of any species for which genome sequence data and a DNA microarray are available. RESULTS: As a first step towards improving the analysis of M-CGH data, we estimated the degree of experimental error in a series of experiments in which identical samples were compared against each other by M-CGH. This variance estimate was used to validate a Log Ratio-based methodology for identification of outliers in M-CGH data. We compared two genome strains by M-CGH to examine the effect of probe/target identity on the Log Ratios of signal intensities using prior knowledge of gene divergence and gene absence to establish Log Ratio thresholds for the identification of absent and conserved genes. CONCLUSION: The results from this empirical study validate the Log Ratio thresholds that have been used in other studies to establish gene divergence/absence. Moreover, the analytical framework presented here enhances the information content derived from M-CGH data by shifting the focus from divergent/absent gene detection to accurate detection of conserved and absent genes. This approach closely aligns the technical limitations of M-CGH analysis with practical limitations on the biological interpretation of comparative genomics data. BioMed Central 2005-05-27 /pmc/articles/PMC1168892/ /pubmed/15918914 http://dx.doi.org/10.1186/1471-2164-6-78 Text en Copyright © 2005 Taboada et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Taboada, Eduardo N
Acedillo, Rey R
Luebbert, Christian C
Findlay, Wendy A
Nash, John HE
A new approach for the analysis of bacterial microarray-based Comparative Genomic Hybridization: insights from an empirical study
title A new approach for the analysis of bacterial microarray-based Comparative Genomic Hybridization: insights from an empirical study
title_full A new approach for the analysis of bacterial microarray-based Comparative Genomic Hybridization: insights from an empirical study
title_fullStr A new approach for the analysis of bacterial microarray-based Comparative Genomic Hybridization: insights from an empirical study
title_full_unstemmed A new approach for the analysis of bacterial microarray-based Comparative Genomic Hybridization: insights from an empirical study
title_short A new approach for the analysis of bacterial microarray-based Comparative Genomic Hybridization: insights from an empirical study
title_sort new approach for the analysis of bacterial microarray-based comparative genomic hybridization: insights from an empirical study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1168892/
https://www.ncbi.nlm.nih.gov/pubmed/15918914
http://dx.doi.org/10.1186/1471-2164-6-78
work_keys_str_mv AT taboadaeduardon anewapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy
AT acedilloreyr anewapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy
AT luebbertchristianc anewapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy
AT findlaywendya anewapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy
AT nashjohnhe anewapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy
AT taboadaeduardon newapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy
AT acedilloreyr newapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy
AT luebbertchristianc newapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy
AT findlaywendya newapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy
AT nashjohnhe newapproachfortheanalysisofbacterialmicroarraybasedcomparativegenomichybridizationinsightsfromanempiricalstudy