Cargando…

Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization

DNA microarray measurements are susceptible to error caused by non-specific hybridization between a probe and a target (cross-hybridization), or between two targets (bulk-hybridization). Search algorithms such as BLASTN can quickly identify potentially hybridizing sequences. We set out to improve BL...

Descripción completa

Detalles Bibliográficos
Autores principales: Eklund, Aron C., Friis, Pia, Wernersson, Rasmus, Szallasi, Zoltan
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2831327/
https://www.ncbi.nlm.nih.gov/pubmed/19969549
http://dx.doi.org/10.1093/nar/gkp1116
_version_ 1782178239044648960
author Eklund, Aron C.
Friis, Pia
Wernersson, Rasmus
Szallasi, Zoltan
author_facet Eklund, Aron C.
Friis, Pia
Wernersson, Rasmus
Szallasi, Zoltan
author_sort Eklund, Aron C.
collection PubMed
description DNA microarray measurements are susceptible to error caused by non-specific hybridization between a probe and a target (cross-hybridization), or between two targets (bulk-hybridization). Search algorithms such as BLASTN can quickly identify potentially hybridizing sequences. We set out to improve BLASTN accuracy by modifying the substitution matrix and gap penalties. We generated gene expression microarray data for samples in which 1 or 10% of the target mass was an exogenous spike of known sequence. We found that the 10% spike induced 2-fold intensity changes in 3% of the probes, two-third of which were decreases in intensity likely caused by bulk-hybridization. These changes were correlated with similarity between the spike and probe sequences. Interestingly, even very weak similarities tended to induce a change in probe intensity with the 10% spike. Using this data, we optimized the BLASTN substitution matrix to more accurately identify probes susceptible to non-specific hybridization with the spike. Relative to the default substitution matrix, the optimized matrix features a decreased score for A–T base pairs relative to G–C base pairs, resulting in a 5–15% increase in area under the ROC curve for identifying affected probes. This optimized matrix may be useful in the design of microarray probes, and in other BLASTN-based searches for hybridization partners.
format Text
id pubmed-2831327
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28313272010-03-03 Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization Eklund, Aron C. Friis, Pia Wernersson, Rasmus Szallasi, Zoltan Nucleic Acids Res Methods Online DNA microarray measurements are susceptible to error caused by non-specific hybridization between a probe and a target (cross-hybridization), or between two targets (bulk-hybridization). Search algorithms such as BLASTN can quickly identify potentially hybridizing sequences. We set out to improve BLASTN accuracy by modifying the substitution matrix and gap penalties. We generated gene expression microarray data for samples in which 1 or 10% of the target mass was an exogenous spike of known sequence. We found that the 10% spike induced 2-fold intensity changes in 3% of the probes, two-third of which were decreases in intensity likely caused by bulk-hybridization. These changes were correlated with similarity between the spike and probe sequences. Interestingly, even very weak similarities tended to induce a change in probe intensity with the 10% spike. Using this data, we optimized the BLASTN substitution matrix to more accurately identify probes susceptible to non-specific hybridization with the spike. Relative to the default substitution matrix, the optimized matrix features a decreased score for A–T base pairs relative to G–C base pairs, resulting in a 5–15% increase in area under the ROC curve for identifying affected probes. This optimized matrix may be useful in the design of microarray probes, and in other BLASTN-based searches for hybridization partners. Oxford University Press 2010-03 2009-12-06 /pmc/articles/PMC2831327/ /pubmed/19969549 http://dx.doi.org/10.1093/nar/gkp1116 Text en © The Author(s) 2009. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Eklund, Aron C.
Friis, Pia
Wernersson, Rasmus
Szallasi, Zoltan
Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization
title Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization
title_full Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization
title_fullStr Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization
title_full_unstemmed Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization
title_short Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization
title_sort optimization of the blastn substitution matrix for prediction of non-specific dna microarray hybridization
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2831327/
https://www.ncbi.nlm.nih.gov/pubmed/19969549
http://dx.doi.org/10.1093/nar/gkp1116
work_keys_str_mv AT eklundaronc optimizationoftheblastnsubstitutionmatrixforpredictionofnonspecificdnamicroarrayhybridization
AT friispia optimizationoftheblastnsubstitutionmatrixforpredictionofnonspecificdnamicroarrayhybridization
AT wernerssonrasmus optimizationoftheblastnsubstitutionmatrixforpredictionofnonspecificdnamicroarrayhybridization
AT szallasizoltan optimizationoftheblastnsubstitutionmatrixforpredictionofnonspecificdnamicroarrayhybridization