Cargando…
Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays
BACKGROUND: Alternative splicing and isoform level expression profiling is an emerging field of interest within genomics. Splicing sensitive microarrays, with probes targeted to individual exons or exon-junctions, are becoming increasingly popular as a tool capable of both expression profiling and f...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2585104/ https://www.ncbi.nlm.nih.gov/pubmed/18990248 http://dx.doi.org/10.1186/1471-2164-9-529 |
_version_ | 1782160839559610368 |
---|---|
author | Bemmo, Amandine Benovoy, David Kwan, Tony Gaffney, Daniel J Jensen, Roderick V Majewski, Jacek |
author_facet | Bemmo, Amandine Benovoy, David Kwan, Tony Gaffney, Daniel J Jensen, Roderick V Majewski, Jacek |
author_sort | Bemmo, Amandine |
collection | PubMed |
description | BACKGROUND: Alternative splicing and isoform level expression profiling is an emerging field of interest within genomics. Splicing sensitive microarrays, with probes targeted to individual exons or exon-junctions, are becoming increasingly popular as a tool capable of both expression profiling and finer scale isoform detection. Despite their intuitive appeal, relatively little is known about the performance of such tools, particularly in comparison with more traditional 3' targeted microarrays. Here, we use the well studied Microarray Quality Control (MAQC) dataset to benchmark the Affymetrix Exon Array, and compare it to two other popular platforms: Illumina, and Affymetrix U133. RESULTS: We show that at the gene expression level, the Exon Array performs comparably with the two 3' targeted platforms. However, the interplatform correlation of the results is slightly lower than between the two 3' arrays. We show that some of the discrepancies stem from the RNA amplification protocols, e.g. the Exon Array is able to detect expression of non-polyadenylated transcripts. More importantly, we show that many other differences result from the ability of the Exon Array to monitor more detailed isoform-level changes; several examples illustrate that changes detected by the 3' platforms are actually isoform variations, and that the nature of these variations can be resolved using Exon Array data. Finally, we show how the Exon Array can be used to detect alternative isoform differences, such as alternative splicing, transcript termination, and alternative promoter usage. We discuss the possible pitfalls and false positives resulting from isoform-level analysis. CONCLUSION: The Exon Array is a valuable tool that can be used to profile gene expression while providing important additional information regarding the types of gene isoforms that are expressed and variable. However, analysis of alternative splicing requires much more hands on effort and visualization of results in order to correctly interpret the data, and generally results in considerably higher false positive rates than expression analysis. One of the main sources of error in the MAQC dataset is variation in amplification efficiency across transcripts, most likely caused by joint effects of elevated GC content in the 5' ends of genes and reduced likelihood of random-primed first strand synthesis in the 3' ends of genes. These effects are currently not adequately corrected using existing statistical methods. We outline approaches to reduce such errors by filtering out potentially problematic data. |
format | Text |
id | pubmed-2585104 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-25851042008-11-20 Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays Bemmo, Amandine Benovoy, David Kwan, Tony Gaffney, Daniel J Jensen, Roderick V Majewski, Jacek BMC Genomics Methodology Article BACKGROUND: Alternative splicing and isoform level expression profiling is an emerging field of interest within genomics. Splicing sensitive microarrays, with probes targeted to individual exons or exon-junctions, are becoming increasingly popular as a tool capable of both expression profiling and finer scale isoform detection. Despite their intuitive appeal, relatively little is known about the performance of such tools, particularly in comparison with more traditional 3' targeted microarrays. Here, we use the well studied Microarray Quality Control (MAQC) dataset to benchmark the Affymetrix Exon Array, and compare it to two other popular platforms: Illumina, and Affymetrix U133. RESULTS: We show that at the gene expression level, the Exon Array performs comparably with the two 3' targeted platforms. However, the interplatform correlation of the results is slightly lower than between the two 3' arrays. We show that some of the discrepancies stem from the RNA amplification protocols, e.g. the Exon Array is able to detect expression of non-polyadenylated transcripts. More importantly, we show that many other differences result from the ability of the Exon Array to monitor more detailed isoform-level changes; several examples illustrate that changes detected by the 3' platforms are actually isoform variations, and that the nature of these variations can be resolved using Exon Array data. Finally, we show how the Exon Array can be used to detect alternative isoform differences, such as alternative splicing, transcript termination, and alternative promoter usage. We discuss the possible pitfalls and false positives resulting from isoform-level analysis. CONCLUSION: The Exon Array is a valuable tool that can be used to profile gene expression while providing important additional information regarding the types of gene isoforms that are expressed and variable. However, analysis of alternative splicing requires much more hands on effort and visualization of results in order to correctly interpret the data, and generally results in considerably higher false positive rates than expression analysis. One of the main sources of error in the MAQC dataset is variation in amplification efficiency across transcripts, most likely caused by joint effects of elevated GC content in the 5' ends of genes and reduced likelihood of random-primed first strand synthesis in the 3' ends of genes. These effects are currently not adequately corrected using existing statistical methods. We outline approaches to reduce such errors by filtering out potentially problematic data. BioMed Central 2008-11-07 /pmc/articles/PMC2585104/ /pubmed/18990248 http://dx.doi.org/10.1186/1471-2164-9-529 Text en Copyright © 2008 Bemmo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Bemmo, Amandine Benovoy, David Kwan, Tony Gaffney, Daniel J Jensen, Roderick V Majewski, Jacek Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays |
title | Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays |
title_full | Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays |
title_fullStr | Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays |
title_full_unstemmed | Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays |
title_short | Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays |
title_sort | gene expression and isoform variation analysis using affymetrix exon arrays |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2585104/ https://www.ncbi.nlm.nih.gov/pubmed/18990248 http://dx.doi.org/10.1186/1471-2164-9-529 |
work_keys_str_mv | AT bemmoamandine geneexpressionandisoformvariationanalysisusingaffymetrixexonarrays AT benovoydavid geneexpressionandisoformvariationanalysisusingaffymetrixexonarrays AT kwantony geneexpressionandisoformvariationanalysisusingaffymetrixexonarrays AT gaffneydanielj geneexpressionandisoformvariationanalysisusingaffymetrixexonarrays AT jensenroderickv geneexpressionandisoformvariationanalysisusingaffymetrixexonarrays AT majewskijacek geneexpressionandisoformvariationanalysisusingaffymetrixexonarrays |