Cargando…
Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome
RNA-seq analysis has enabled the evaluation of transcriptional changes in many species including nonmodel organisms. However, in most species only a single reference genome is available and RNA-seq reads from highly divergent varieties are typically aligned to this reference. Here, we quantify the i...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6521602/ https://www.ncbi.nlm.nih.gov/pubmed/30872414 http://dx.doi.org/10.1261/rna.070227.118 |
_version_ | 1783418996230455296 |
---|---|
author | Slabaugh, Erin Desai, Jigar S. Sartor, Ryan C. Lawas, Lovely Mae F. Jagadish, S.V. Krishna Doherty, Colleen J. |
author_facet | Slabaugh, Erin Desai, Jigar S. Sartor, Ryan C. Lawas, Lovely Mae F. Jagadish, S.V. Krishna Doherty, Colleen J. |
author_sort | Slabaugh, Erin |
collection | PubMed |
description | RNA-seq analysis has enabled the evaluation of transcriptional changes in many species including nonmodel organisms. However, in most species only a single reference genome is available and RNA-seq reads from highly divergent varieties are typically aligned to this reference. Here, we quantify the impacts of the choice of mapping genome in rice where three high-quality reference genomes are available. We aligned RNA-seq data from a popular productive rice variety to three different reference genomes and found that the identification of differentially expressed genes differed depending on which reference genome was used for mapping. Furthermore, the ability to detect differentially used transcript isoforms was profoundly affected by the choice of reference genome: Only 30% of the differentially used splicing features were detected when reads were mapped to the more commonly used, but more distantly related reference genome. This demonstrated that gene expression and splicing analysis varies considerably depending on the mapping reference genome, and that analysis of individuals that are distantly related to an available reference genome may be improved by acquisition of new genomic reference material. We observed that these differences in transcriptome analysis are, in part, due to the presence of single nucleotide polymorphisms between the sequenced individual and each respective reference genome, as well as annotation differences between the reference genomes that exist even between syntenic orthologs. We conclude that even between two closely related genomes of similar quality, using the reference genome that is most closely related to the species being sampled significantly improves transcriptome analysis. |
format | Online Article Text |
id | pubmed-6521602 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-65216022020-06-01 Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome Slabaugh, Erin Desai, Jigar S. Sartor, Ryan C. Lawas, Lovely Mae F. Jagadish, S.V. Krishna Doherty, Colleen J. RNA Bioinformatics RNA-seq analysis has enabled the evaluation of transcriptional changes in many species including nonmodel organisms. However, in most species only a single reference genome is available and RNA-seq reads from highly divergent varieties are typically aligned to this reference. Here, we quantify the impacts of the choice of mapping genome in rice where three high-quality reference genomes are available. We aligned RNA-seq data from a popular productive rice variety to three different reference genomes and found that the identification of differentially expressed genes differed depending on which reference genome was used for mapping. Furthermore, the ability to detect differentially used transcript isoforms was profoundly affected by the choice of reference genome: Only 30% of the differentially used splicing features were detected when reads were mapped to the more commonly used, but more distantly related reference genome. This demonstrated that gene expression and splicing analysis varies considerably depending on the mapping reference genome, and that analysis of individuals that are distantly related to an available reference genome may be improved by acquisition of new genomic reference material. We observed that these differences in transcriptome analysis are, in part, due to the presence of single nucleotide polymorphisms between the sequenced individual and each respective reference genome, as well as annotation differences between the reference genomes that exist even between syntenic orthologs. We conclude that even between two closely related genomes of similar quality, using the reference genome that is most closely related to the species being sampled significantly improves transcriptome analysis. Cold Spring Harbor Laboratory Press 2019-06 /pmc/articles/PMC6521602/ /pubmed/30872414 http://dx.doi.org/10.1261/rna.070227.118 Text en © 2019 Slabaugh et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by the RNA Society for the first 12 months after the full-issue publication date (see http://rnajournal.cshlp.org/site/misc/terms.xhtml). After 12 months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Bioinformatics Slabaugh, Erin Desai, Jigar S. Sartor, Ryan C. Lawas, Lovely Mae F. Jagadish, S.V. Krishna Doherty, Colleen J. Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome |
title | Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome |
title_full | Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome |
title_fullStr | Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome |
title_full_unstemmed | Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome |
title_short | Analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome |
title_sort | analysis of differential gene expression and alternative splicing is significantly influenced by choice of reference genome |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6521602/ https://www.ncbi.nlm.nih.gov/pubmed/30872414 http://dx.doi.org/10.1261/rna.070227.118 |
work_keys_str_mv | AT slabaugherin analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome AT desaijigars analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome AT sartorryanc analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome AT lawaslovelymaef analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome AT jagadishsvkrishna analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome AT dohertycolleenj analysisofdifferentialgeneexpressionandalternativesplicingissignificantlyinfluencedbychoiceofreferencegenome |