Cargando…

Normalization and centering of array-based heterologous genome hybridization based on divergent control probes

BACKGROUND: Hybridization of heterologous (non-specific) nucleic acids onto arrays designed for model-organisms has been proposed as a viable genomic resource for estimating sequence variation and gene expression in non-model organisms. However, conventional methods of normalization that assume equi...

Descripción completa

Detalles Bibliográficos
Autores principales: Darby, Brian J, Jones, Kenneth L, Wheeler, David, Herman, Michael A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3125262/
https://www.ncbi.nlm.nih.gov/pubmed/21600029
http://dx.doi.org/10.1186/1471-2105-12-183
_version_ 1782207193237422080
author Darby, Brian J
Jones, Kenneth L
Wheeler, David
Herman, Michael A
author_facet Darby, Brian J
Jones, Kenneth L
Wheeler, David
Herman, Michael A
author_sort Darby, Brian J
collection PubMed
description BACKGROUND: Hybridization of heterologous (non-specific) nucleic acids onto arrays designed for model-organisms has been proposed as a viable genomic resource for estimating sequence variation and gene expression in non-model organisms. However, conventional methods of normalization that assume equivalent distributions (such as quantile normalization) are inappropriate when applied to non-specific (heterologous) hybridization. We propose an algorithm for normalizing and centering intensity data from heterologous hybridization that makes no prior assumptions of distribution, reduces the false appearance of homology, and provides a way for researchers to confirm whether heterologous hybridization is suitable. RESULTS: Data are normalized by adjusting for Gibbs free energy binding, and centered by adjusting for the median of a common set of control probes assumed to be equivalently dissimilar for all species. This procedure was compared to existing approaches and found to be as successful as Loess normalization at detecting sequence variations (deletions) and even more successful than quantile normalization at reducing the accumulation of false positive probe matches between two related nematode species, Caenorhabditis elegans and C. briggsae. Despite the improvements, we still found that probe fluorescence intensity was too poorly correlated with sequence similarity to result in reliable detection of matching probe sequence. CONCLUSIONS: Cross-species hybridizations can be a way to adapt genome-enabled tools for closely related non-model organisms, but data must be appropriately normalized and centered in a way that accommodates hybridization of nucleic acids with diverged sequence. For short, 25-mer probes, hybridization intensity alone may be insufficiently correlated with sequence similarity to allow reliable inference of homology at the probe level.
format Online
Article
Text
id pubmed-3125262
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31252622011-06-29 Normalization and centering of array-based heterologous genome hybridization based on divergent control probes Darby, Brian J Jones, Kenneth L Wheeler, David Herman, Michael A BMC Bioinformatics Research Article BACKGROUND: Hybridization of heterologous (non-specific) nucleic acids onto arrays designed for model-organisms has been proposed as a viable genomic resource for estimating sequence variation and gene expression in non-model organisms. However, conventional methods of normalization that assume equivalent distributions (such as quantile normalization) are inappropriate when applied to non-specific (heterologous) hybridization. We propose an algorithm for normalizing and centering intensity data from heterologous hybridization that makes no prior assumptions of distribution, reduces the false appearance of homology, and provides a way for researchers to confirm whether heterologous hybridization is suitable. RESULTS: Data are normalized by adjusting for Gibbs free energy binding, and centered by adjusting for the median of a common set of control probes assumed to be equivalently dissimilar for all species. This procedure was compared to existing approaches and found to be as successful as Loess normalization at detecting sequence variations (deletions) and even more successful than quantile normalization at reducing the accumulation of false positive probe matches between two related nematode species, Caenorhabditis elegans and C. briggsae. Despite the improvements, we still found that probe fluorescence intensity was too poorly correlated with sequence similarity to result in reliable detection of matching probe sequence. CONCLUSIONS: Cross-species hybridizations can be a way to adapt genome-enabled tools for closely related non-model organisms, but data must be appropriately normalized and centered in a way that accommodates hybridization of nucleic acids with diverged sequence. For short, 25-mer probes, hybridization intensity alone may be insufficiently correlated with sequence similarity to allow reliable inference of homology at the probe level. BioMed Central 2011-05-21 /pmc/articles/PMC3125262/ /pubmed/21600029 http://dx.doi.org/10.1186/1471-2105-12-183 Text en Copyright ©2011 Darby et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Darby, Brian J
Jones, Kenneth L
Wheeler, David
Herman, Michael A
Normalization and centering of array-based heterologous genome hybridization based on divergent control probes
title Normalization and centering of array-based heterologous genome hybridization based on divergent control probes
title_full Normalization and centering of array-based heterologous genome hybridization based on divergent control probes
title_fullStr Normalization and centering of array-based heterologous genome hybridization based on divergent control probes
title_full_unstemmed Normalization and centering of array-based heterologous genome hybridization based on divergent control probes
title_short Normalization and centering of array-based heterologous genome hybridization based on divergent control probes
title_sort normalization and centering of array-based heterologous genome hybridization based on divergent control probes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3125262/
https://www.ncbi.nlm.nih.gov/pubmed/21600029
http://dx.doi.org/10.1186/1471-2105-12-183
work_keys_str_mv AT darbybrianj normalizationandcenteringofarraybasedheterologousgenomehybridizationbasedondivergentcontrolprobes
AT joneskennethl normalizationandcenteringofarraybasedheterologousgenomehybridizationbasedondivergentcontrolprobes
AT wheelerdavid normalizationandcenteringofarraybasedheterologousgenomehybridizationbasedondivergentcontrolprobes
AT hermanmichaela normalizationandcenteringofarraybasedheterologousgenomehybridizationbasedondivergentcontrolprobes