Cargando…
Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy
BACKGROUND: The design of long oligonucleotides for spotted DNA microarrays requires detailed attention to ensure their optimal performance in the hybridization process. The main challenge is to select an optimal oligonucleotide element that represents each genetic locus/gene in the genome and is un...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2099447/ https://www.ncbi.nlm.nih.gov/pubmed/17880708 http://dx.doi.org/10.1186/1471-2105-8-350 |
_version_ | 1782138312012595200 |
---|---|
author | Hu, Guangan Llinás, Manuel Li, Jingguang Preiser, Peter Rainer Bozdech, Zbynek |
author_facet | Hu, Guangan Llinás, Manuel Li, Jingguang Preiser, Peter Rainer Bozdech, Zbynek |
author_sort | Hu, Guangan |
collection | PubMed |
description | BACKGROUND: The design of long oligonucleotides for spotted DNA microarrays requires detailed attention to ensure their optimal performance in the hybridization process. The main challenge is to select an optimal oligonucleotide element that represents each genetic locus/gene in the genome and is unique, devoid of internal structures and repetitive sequences and its Tm is uniform with all other elements on the microarray. Currently, all of the publicly available programs for DNA long oligonucleotide microarray selection utilize various combinations of cutoffs in which each parameter (uniqueness, Tm, and secondary structure) is evaluated and filtered individually. The use of the cutoffs can, however, lead to information loss and to selection of suboptimal oligonucleotides, especially for genomes with extreme distribution of the GC content, a large proportion of repetitive sequences or the presence of large gene families with highly homologous members. RESULTS: Here we present the program OligoRankPick which is using a weighted rank-based strategy to select microarray oligonucleotide elements via an integer weighted linear function. This approach optimizes the selection criteria (weight score) for each gene individually, accommodating variable properties of the DNA sequence along the genome. The designed algorithm was tested using three microbial genomes Escherichia coli, Saccharomyces cerevisiae and the human malaria parasite species Plasmodium falciparum. In comparison to other published algorithms OligoRankPick provides significant improvements in oligonucleotide design for all three genomes with the most significant improvements observed in the microarray design for P. falciparum whose genome is characterized by large fluctuations of GC content, and abundant gene duplications. CONCLUSION: OligoRankPick is an efficient tool for the design of long oligonucleotide DNA microarrays which does not rely on direct oligonucleotide exclusion by parameter cutoffs but instead optimizes all parameters in context of each other. The weighted rank-sum strategy utilized by this algorithm provides high flexibility of oligonucleotide selection which accommodates extreme variability of DNA sequence properties along genomes of many organisms. |
format | Text |
id | pubmed-2099447 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-20994472007-12-03 Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy Hu, Guangan Llinás, Manuel Li, Jingguang Preiser, Peter Rainer Bozdech, Zbynek BMC Bioinformatics Methodology Article BACKGROUND: The design of long oligonucleotides for spotted DNA microarrays requires detailed attention to ensure their optimal performance in the hybridization process. The main challenge is to select an optimal oligonucleotide element that represents each genetic locus/gene in the genome and is unique, devoid of internal structures and repetitive sequences and its Tm is uniform with all other elements on the microarray. Currently, all of the publicly available programs for DNA long oligonucleotide microarray selection utilize various combinations of cutoffs in which each parameter (uniqueness, Tm, and secondary structure) is evaluated and filtered individually. The use of the cutoffs can, however, lead to information loss and to selection of suboptimal oligonucleotides, especially for genomes with extreme distribution of the GC content, a large proportion of repetitive sequences or the presence of large gene families with highly homologous members. RESULTS: Here we present the program OligoRankPick which is using a weighted rank-based strategy to select microarray oligonucleotide elements via an integer weighted linear function. This approach optimizes the selection criteria (weight score) for each gene individually, accommodating variable properties of the DNA sequence along the genome. The designed algorithm was tested using three microbial genomes Escherichia coli, Saccharomyces cerevisiae and the human malaria parasite species Plasmodium falciparum. In comparison to other published algorithms OligoRankPick provides significant improvements in oligonucleotide design for all three genomes with the most significant improvements observed in the microarray design for P. falciparum whose genome is characterized by large fluctuations of GC content, and abundant gene duplications. CONCLUSION: OligoRankPick is an efficient tool for the design of long oligonucleotide DNA microarrays which does not rely on direct oligonucleotide exclusion by parameter cutoffs but instead optimizes all parameters in context of each other. The weighted rank-sum strategy utilized by this algorithm provides high flexibility of oligonucleotide selection which accommodates extreme variability of DNA sequence properties along genomes of many organisms. BioMed Central 2007-09-19 /pmc/articles/PMC2099447/ /pubmed/17880708 http://dx.doi.org/10.1186/1471-2105-8-350 Text en Copyright © 2007 Hu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Hu, Guangan Llinás, Manuel Li, Jingguang Preiser, Peter Rainer Bozdech, Zbynek Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy |
title | Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy |
title_full | Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy |
title_fullStr | Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy |
title_full_unstemmed | Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy |
title_short | Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy |
title_sort | selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2099447/ https://www.ncbi.nlm.nih.gov/pubmed/17880708 http://dx.doi.org/10.1186/1471-2105-8-350 |
work_keys_str_mv | AT huguangan selectionoflongoligonucleotidesforgeneexpressionmicroarraysusingweightedranksumstrategy AT llinasmanuel selectionoflongoligonucleotidesforgeneexpressionmicroarraysusingweightedranksumstrategy AT lijingguang selectionoflongoligonucleotidesforgeneexpressionmicroarraysusingweightedranksumstrategy AT preiserpeterrainer selectionoflongoligonucleotidesforgeneexpressionmicroarraysusingweightedranksumstrategy AT bozdechzbynek selectionoflongoligonucleotidesforgeneexpressionmicroarraysusingweightedranksumstrategy |