Cargando…

A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data

Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeli...

Descripción completa

Detalles Bibliográficos
Autores principales: Barbosa-Morais, Nuno L., Dunning, Mark J., Samarajiwa, Shamith A., Darot, Jeremy F. J., Ritchie, Matthew E., Lynch, Andy G., Tavaré, Simon
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817484/
https://www.ncbi.nlm.nih.gov/pubmed/19923232
http://dx.doi.org/10.1093/nar/gkp942
_version_ 1782177206780297216
author Barbosa-Morais, Nuno L.
Dunning, Mark J.
Samarajiwa, Shamith A.
Darot, Jeremy F. J.
Ritchie, Matthew E.
Lynch, Andy G.
Tavaré, Simon
author_facet Barbosa-Morais, Nuno L.
Dunning, Mark J.
Samarajiwa, Shamith A.
Darot, Jeremy F. J.
Ritchie, Matthew E.
Lynch, Andy G.
Tavaré, Simon
author_sort Barbosa-Morais, Nuno L.
collection PubMed
description Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeline for the complete genomic and transcriptomic re-annotation of Illumina probe sequences, also applicable to other platforms, with its output available through a Web interface and incorporated into Bioconductor packages. We have identified several problems with the design of individual probes and we show the benefits of probe re-annotation on the analysis of BeadArray gene expression data sets. We discuss the importance of aspects such as probe coverage of individual transcripts, alternative messenger RNA splicing, single-nucleotide polymorphisms, repeat sequences, RNA degradation biases and probes targeting genomic regions with no known transcription. We conclude that many of the Illumina probes have unreliable original annotation and that our re-annotation allows analyses to focus on the good quality probes, which form the majority, and also to expand the scope of biological information that can be extracted.
format Text
id pubmed-2817484
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28174842010-02-08 A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data Barbosa-Morais, Nuno L. Dunning, Mark J. Samarajiwa, Shamith A. Darot, Jeremy F. J. Ritchie, Matthew E. Lynch, Andy G. Tavaré, Simon Nucleic Acids Res Methods Online Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeline for the complete genomic and transcriptomic re-annotation of Illumina probe sequences, also applicable to other platforms, with its output available through a Web interface and incorporated into Bioconductor packages. We have identified several problems with the design of individual probes and we show the benefits of probe re-annotation on the analysis of BeadArray gene expression data sets. We discuss the importance of aspects such as probe coverage of individual transcripts, alternative messenger RNA splicing, single-nucleotide polymorphisms, repeat sequences, RNA degradation biases and probes targeting genomic regions with no known transcription. We conclude that many of the Illumina probes have unreliable original annotation and that our re-annotation allows analyses to focus on the good quality probes, which form the majority, and also to expand the scope of biological information that can be extracted. Oxford University Press 2010-01 2009-11-18 /pmc/articles/PMC2817484/ /pubmed/19923232 http://dx.doi.org/10.1093/nar/gkp942 Text en © The Author(s) 2009. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Barbosa-Morais, Nuno L.
Dunning, Mark J.
Samarajiwa, Shamith A.
Darot, Jeremy F. J.
Ritchie, Matthew E.
Lynch, Andy G.
Tavaré, Simon
A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data
title A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data
title_full A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data
title_fullStr A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data
title_full_unstemmed A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data
title_short A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data
title_sort re-annotation pipeline for illumina beadarrays: improving the interpretation of gene expression data
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817484/
https://www.ncbi.nlm.nih.gov/pubmed/19923232
http://dx.doi.org/10.1093/nar/gkp942
work_keys_str_mv AT barbosamoraisnunol areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT dunningmarkj areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT samarajiwashamitha areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT darotjeremyfj areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT ritchiematthewe areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT lynchandyg areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT tavaresimon areannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT barbosamoraisnunol reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT dunningmarkj reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT samarajiwashamitha reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT darotjeremyfj reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT ritchiematthewe reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT lynchandyg reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata
AT tavaresimon reannotationpipelineforilluminabeadarraysimprovingtheinterpretationofgeneexpressiondata