Cargando…

OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity

BACKGROUND: High throughput gene expression studies using oligonucleotide microarrays depend on the specificity of each oligonucleotide (oligo or probe) for its target gene. However, target specific probes can only be designed when a reference genome of the species at hand were completely sequenced,...

Descripción completa

Detalles Bibliográficos
Autores principales: Neerincx, Pieter BT, Rauwerda, Han, Nie, Haisheng, Groenen, Martien AM, Breit, Timo M, Leunissen, Jack AM
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2712747/
https://www.ncbi.nlm.nih.gov/pubmed/19615117
http://dx.doi.org/10.1186/1753-6561-3-S4-S4
_version_ 1782169524954464256
author Neerincx, Pieter BT
Rauwerda, Han
Nie, Haisheng
Groenen, Martien AM
Breit, Timo M
Leunissen, Jack AM
author_facet Neerincx, Pieter BT
Rauwerda, Han
Nie, Haisheng
Groenen, Martien AM
Breit, Timo M
Leunissen, Jack AM
author_sort Neerincx, Pieter BT
collection PubMed
description BACKGROUND: High throughput gene expression studies using oligonucleotide microarrays depend on the specificity of each oligonucleotide (oligo or probe) for its target gene. However, target specific probes can only be designed when a reference genome of the species at hand were completely sequenced, when this genome were completely annotated and when the genetic variation of the sampled individuals were completely known. Unfortunately there is not a single species for which such a complete data set is available. Therefore, it is important that probe annotation can be updated frequently for optimal interpretation of microarray experiments. RESULTS: In this paper we present OligoRAP, a pipeline to automatically update the annotation of oligo libraries and estimate oligo target specificity. OligoRAP uses a reference genome assembly with Ensembl and Entrez Gene annotation supplemented with a set of unmapped transcripts derived from RefSeq and UniGene to handle assembly gaps. OligoRAP produces alignments of each oligo with the reference assembly as well as with unmapped transcripts. These alignments are re-mapped to the annotation sources, which results in a concise, as complete as possible and up-to-date annotation of the oligo library. The building blocks of this pipeline are BioMoby web services creating a highly modular and distributed system with a robust, remote programmatic interface. OligoRAP was used to update the annotation for a subset of 791 oligos from the ARK-Genomics 20 K chicken array, which were selected as starting material for the oligo annotation session of the EADGENE/SABRE Post-analysis workshop. Based on the updated annotation about one third of these oligos is problematic with regard to target specificity. In addition, the accession numbers or ids the oligos were originally designed for no longer exist in the updated annotation for almost half of the oligos. CONCLUSION: As microarrays are designed on incomplete data, it is important to update probe annotation and check target specificity regularly. OligoRAP provides both and due to its design based on BioMoby web services it can easily be embedded as an oligo annotation engine in customised applications for microarray data analysis. The dramatic difference in updated annotation and target specificity for the ARK-Genomics 20 K chicken array as compared to the original data emphasises the need for regular updates.
format Text
id pubmed-2712747
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27127472009-07-20 OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity Neerincx, Pieter BT Rauwerda, Han Nie, Haisheng Groenen, Martien AM Breit, Timo M Leunissen, Jack AM BMC Proc Research BACKGROUND: High throughput gene expression studies using oligonucleotide microarrays depend on the specificity of each oligonucleotide (oligo or probe) for its target gene. However, target specific probes can only be designed when a reference genome of the species at hand were completely sequenced, when this genome were completely annotated and when the genetic variation of the sampled individuals were completely known. Unfortunately there is not a single species for which such a complete data set is available. Therefore, it is important that probe annotation can be updated frequently for optimal interpretation of microarray experiments. RESULTS: In this paper we present OligoRAP, a pipeline to automatically update the annotation of oligo libraries and estimate oligo target specificity. OligoRAP uses a reference genome assembly with Ensembl and Entrez Gene annotation supplemented with a set of unmapped transcripts derived from RefSeq and UniGene to handle assembly gaps. OligoRAP produces alignments of each oligo with the reference assembly as well as with unmapped transcripts. These alignments are re-mapped to the annotation sources, which results in a concise, as complete as possible and up-to-date annotation of the oligo library. The building blocks of this pipeline are BioMoby web services creating a highly modular and distributed system with a robust, remote programmatic interface. OligoRAP was used to update the annotation for a subset of 791 oligos from the ARK-Genomics 20 K chicken array, which were selected as starting material for the oligo annotation session of the EADGENE/SABRE Post-analysis workshop. Based on the updated annotation about one third of these oligos is problematic with regard to target specificity. In addition, the accession numbers or ids the oligos were originally designed for no longer exist in the updated annotation for almost half of the oligos. CONCLUSION: As microarrays are designed on incomplete data, it is important to update probe annotation and check target specificity regularly. OligoRAP provides both and due to its design based on BioMoby web services it can easily be embedded as an oligo annotation engine in customised applications for microarray data analysis. The dramatic difference in updated annotation and target specificity for the ARK-Genomics 20 K chicken array as compared to the original data emphasises the need for regular updates. BioMed Central 2009-07-16 /pmc/articles/PMC2712747/ /pubmed/19615117 http://dx.doi.org/10.1186/1753-6561-3-S4-S4 Text en Copyright © 2009 Neerincx et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Neerincx, Pieter BT
Rauwerda, Han
Nie, Haisheng
Groenen, Martien AM
Breit, Timo M
Leunissen, Jack AM
OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity
title OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity
title_full OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity
title_fullStr OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity
title_full_unstemmed OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity
title_short OligoRAP – an Oligo Re-Annotation Pipeline to improve annotation and estimate target specificity
title_sort oligorap – an oligo re-annotation pipeline to improve annotation and estimate target specificity
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2712747/
https://www.ncbi.nlm.nih.gov/pubmed/19615117
http://dx.doi.org/10.1186/1753-6561-3-S4-S4
work_keys_str_mv AT neerincxpieterbt oligorapanoligoreannotationpipelinetoimproveannotationandestimatetargetspecificity
AT rauwerdahan oligorapanoligoreannotationpipelinetoimproveannotationandestimatetargetspecificity
AT niehaisheng oligorapanoligoreannotationpipelinetoimproveannotationandestimatetargetspecificity
AT groenenmartienam oligorapanoligoreannotationpipelinetoimproveannotationandestimatetargetspecificity
AT breittimom oligorapanoligoreannotationpipelinetoimproveannotationandestimatetargetspecificity
AT leunissenjackam oligorapanoligoreannotationpipelinetoimproveannotationandestimatetargetspecificity