Cargando…

Data analysis issues for allele-specific expression using Illumina's GoldenGate assay

BACKGROUND: High-throughput measurement of allele-specific expression (ASE) is a relatively new and exciting application area for array-based technologies. In this paper, we explore several data sets which make use of Illumina's GoldenGate BeadArray technology to measure ASE. This platform expl...

Descripción completa

Detalles Bibliográficos
Autores principales: Ritchie, Matthew E, Forrest, Matthew S, Dimas, Antigone S, Daelemans, Caroline, Dermitzakis, Emmanouil T, Deloukas, Panagiotis, Tavaré, Simon
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2887809/
https://www.ncbi.nlm.nih.gov/pubmed/20504309
http://dx.doi.org/10.1186/1471-2105-11-280
_version_ 1782182588242198528
author Ritchie, Matthew E
Forrest, Matthew S
Dimas, Antigone S
Daelemans, Caroline
Dermitzakis, Emmanouil T
Deloukas, Panagiotis
Tavaré, Simon
author_facet Ritchie, Matthew E
Forrest, Matthew S
Dimas, Antigone S
Daelemans, Caroline
Dermitzakis, Emmanouil T
Deloukas, Panagiotis
Tavaré, Simon
author_sort Ritchie, Matthew E
collection PubMed
description BACKGROUND: High-throughput measurement of allele-specific expression (ASE) is a relatively new and exciting application area for array-based technologies. In this paper, we explore several data sets which make use of Illumina's GoldenGate BeadArray technology to measure ASE. This platform exploits coding SNPs to obtain relative expression measurements for alleles at approximately 1500 positions in the genome. RESULTS: We analyze data from a mixture experiment where genomic DNA samples from pairs of individuals of known genotypes are pooled to create allelic imbalances at varying levels for the majority of SNPs on the array. We observe that GoldenGate has less sensitivity at detecting subtle allelic imbalances (around 1.3 fold) compared to extreme imbalances, and note the benefit of applying local background correction to the data. Analysis of data from a dye-swap control experiment allowed us to quantify dye-bias, which can be reduced considerably by careful normalization. The need to filter the data before carrying out further downstream analysis to remove non-responding probes, which show either weak, or non-specific signal for each allele, was also demonstrated. Throughout this paper, we find that a linear model analysis of the data from each SNP is a flexible modelling strategy that allows for testing of allelic imbalances in each sample when replicate hybridizations are available. CONCLUSIONS: Our analysis shows that local background correction carried out by Illumina's software, together with quantile normalization of the red and green channels within each array, provides optimal performance in terms of false positive rates. In addition, we strongly encourage intensity-based filtering to remove SNPs which only measure non-specific signal. We anticipate that a similar analysis strategy will prove useful when quantifying ASE on Illumina's higher density Infinium BeadChips.
format Text
id pubmed-2887809
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28878092010-06-19 Data analysis issues for allele-specific expression using Illumina's GoldenGate assay Ritchie, Matthew E Forrest, Matthew S Dimas, Antigone S Daelemans, Caroline Dermitzakis, Emmanouil T Deloukas, Panagiotis Tavaré, Simon BMC Bioinformatics Methodology article BACKGROUND: High-throughput measurement of allele-specific expression (ASE) is a relatively new and exciting application area for array-based technologies. In this paper, we explore several data sets which make use of Illumina's GoldenGate BeadArray technology to measure ASE. This platform exploits coding SNPs to obtain relative expression measurements for alleles at approximately 1500 positions in the genome. RESULTS: We analyze data from a mixture experiment where genomic DNA samples from pairs of individuals of known genotypes are pooled to create allelic imbalances at varying levels for the majority of SNPs on the array. We observe that GoldenGate has less sensitivity at detecting subtle allelic imbalances (around 1.3 fold) compared to extreme imbalances, and note the benefit of applying local background correction to the data. Analysis of data from a dye-swap control experiment allowed us to quantify dye-bias, which can be reduced considerably by careful normalization. The need to filter the data before carrying out further downstream analysis to remove non-responding probes, which show either weak, or non-specific signal for each allele, was also demonstrated. Throughout this paper, we find that a linear model analysis of the data from each SNP is a flexible modelling strategy that allows for testing of allelic imbalances in each sample when replicate hybridizations are available. CONCLUSIONS: Our analysis shows that local background correction carried out by Illumina's software, together with quantile normalization of the red and green channels within each array, provides optimal performance in terms of false positive rates. In addition, we strongly encourage intensity-based filtering to remove SNPs which only measure non-specific signal. We anticipate that a similar analysis strategy will prove useful when quantifying ASE on Illumina's higher density Infinium BeadChips. BioMed Central 2010-05-26 /pmc/articles/PMC2887809/ /pubmed/20504309 http://dx.doi.org/10.1186/1471-2105-11-280 Text en Copyright ©2010 Ritchie et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Ritchie, Matthew E
Forrest, Matthew S
Dimas, Antigone S
Daelemans, Caroline
Dermitzakis, Emmanouil T
Deloukas, Panagiotis
Tavaré, Simon
Data analysis issues for allele-specific expression using Illumina's GoldenGate assay
title Data analysis issues for allele-specific expression using Illumina's GoldenGate assay
title_full Data analysis issues for allele-specific expression using Illumina's GoldenGate assay
title_fullStr Data analysis issues for allele-specific expression using Illumina's GoldenGate assay
title_full_unstemmed Data analysis issues for allele-specific expression using Illumina's GoldenGate assay
title_short Data analysis issues for allele-specific expression using Illumina's GoldenGate assay
title_sort data analysis issues for allele-specific expression using illumina's goldengate assay
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2887809/
https://www.ncbi.nlm.nih.gov/pubmed/20504309
http://dx.doi.org/10.1186/1471-2105-11-280
work_keys_str_mv AT ritchiematthewe dataanalysisissuesforallelespecificexpressionusingilluminasgoldengateassay
AT forrestmatthews dataanalysisissuesforallelespecificexpressionusingilluminasgoldengateassay
AT dimasantigones dataanalysisissuesforallelespecificexpressionusingilluminasgoldengateassay
AT daelemanscaroline dataanalysisissuesforallelespecificexpressionusingilluminasgoldengateassay
AT dermitzakisemmanouilt dataanalysisissuesforallelespecificexpressionusingilluminasgoldengateassay
AT deloukaspanagiotis dataanalysisissuesforallelespecificexpressionusingilluminasgoldengateassay
AT tavaresimon dataanalysisissuesforallelespecificexpressionusingilluminasgoldengateassay