Cargando…
A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data
Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeli...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817484/ https://www.ncbi.nlm.nih.gov/pubmed/19923232 http://dx.doi.org/10.1093/nar/gkp942 |
_version_ | 1782177206780297216 |
---|---|
author | Barbosa-Morais, Nuno L. Dunning, Mark J. Samarajiwa, Shamith A. Darot, Jeremy F. J. Ritchie, Matthew E. Lynch, Andy G. Tavaré, Simon |
author_facet | Barbosa-Morais, Nuno L. Dunning, Mark J. Samarajiwa, Shamith A. Darot, Jeremy F. J. Ritchie, Matthew E. Lynch, Andy G. Tavaré, Simon |
author_sort | Barbosa-Morais, Nuno L. |
collection | PubMed |
description | Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeline for the complete genomic and transcriptomic re-annotation of Illumina probe sequences, also applicable to other platforms, with its output available through a Web interface and incorporated into Bioconductor packages. We have identified several problems with the design of individual probes and we show the benefits of probe re-annotation on the analysis of BeadArray gene expression data sets. We discuss the importance of aspects such as probe coverage of individual transcripts, alternative messenger RNA splicing, single-nucleotide polymorphisms, repeat sequences, RNA degradation biases and probes targeting genomic regions with no known transcription. We conclude that many of the Illumina probes have unreliable original annotation and that our re-annotation allows analyses to focus on the good quality probes, which form the majority, and also to expand the scope of biological information that can be extracted. |
format | Text |
id | pubmed-2817484 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-28174842010-02-08 A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data Barbosa-Morais, Nuno L. Dunning, Mark J. Samarajiwa, Shamith A. Darot, Jeremy F. J. Ritchie, Matthew E. Lynch, Andy G. Tavaré, Simon Nucleic Acids Res Methods Online Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeline for the complete genomic and transcriptomic re-annotation of Illumina probe sequences, also applicable to other platforms, with its output available through a Web interface and incorporated into Bioconductor packages. We have identified several problems with the design of individual probes and we show the benefits of probe re-annotation on the analysis of BeadArray gene expression data sets. We discuss the importance of aspects such as probe coverage of individual transcripts, alternative messenger RNA splicing, single-nucleotide polymorphisms, repeat sequences, RNA degradation biases and probes targeting genomic regions with no known transcription. We conclude that many of the Illumina probes have unreliable original annotation and that our re-annotation allows analyses to focus on the good quality probes, which form the majority, and also to expand the scope of biological information that can be extracted. Oxford University Press 2010-01 2009-11-18 /pmc/articles/PMC2817484/ /pubmed/19923232 http://dx.doi.org/10.1093/nar/gkp942 Text en © The Author(s) 2009. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Barbosa-Morais, Nuno L. Dunning, Mark J. Samarajiwa, Shamith A. Darot, Jeremy F. J. Ritchie, Matthew E. Lynch, Andy G. Tavaré, Simon A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data |
title | A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data |
title_full | A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data |
title_fullStr | A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data |
title_full_unstemmed | A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data |
title_short | A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data |
title_sort | re-annotation pipeline for illumina beadarrays: improving the interpretation of gene expression data |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817484/ https://www.ncbi.nlm.nih.gov/pubmed/19923232 http://dx.doi.org/10.1093/nar/gkp942 |
work_keys_str_mv | AT barbosamoraisnunol areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT dunningmarkj areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT samarajiwashamitha areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT darotjeremyfj areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT ritchiematthewe areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT lynchandyg areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT tavaresimon areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT barbosamoraisnunol reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT dunningmarkj reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT samarajiwashamitha reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT darotjeremyfj reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT ritchiematthewe reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT lynchandyg reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata AT tavaresimon reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata |