Cargando…

Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data

Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual’s phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wood, David L. A., Nones, Katia, Steptoe, Anita, Christ, Angelika, Harliwong, Ivon, Newell, Felicity, Bruxner, Timothy J. C., Miller, David, Cloonan, Nicole, Grimmond, Sean M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428808/
https://www.ncbi.nlm.nih.gov/pubmed/25965996
http://dx.doi.org/10.1371/journal.pone.0126911
_version_ 1782370942085758976
author Wood, David L. A.
Nones, Katia
Steptoe, Anita
Christ, Angelika
Harliwong, Ivon
Newell, Felicity
Bruxner, Timothy J. C.
Miller, David
Cloonan, Nicole
Grimmond, Sean M.
author_facet Wood, David L. A.
Nones, Katia
Steptoe, Anita
Christ, Angelika
Harliwong, Ivon
Newell, Felicity
Bruxner, Timothy J. C.
Miller, David
Cloonan, Nicole
Grimmond, Sean M.
author_sort Wood, David L. A.
collection PubMed
description Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual’s phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci.
format Online
Article
Text
id pubmed-4428808
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44288082015-05-21 Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data Wood, David L. A. Nones, Katia Steptoe, Anita Christ, Angelika Harliwong, Ivon Newell, Felicity Bruxner, Timothy J. C. Miller, David Cloonan, Nicole Grimmond, Sean M. PLoS One Research Article Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual’s phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci. Public Library of Science 2015-05-12 /pmc/articles/PMC4428808/ /pubmed/25965996 http://dx.doi.org/10.1371/journal.pone.0126911 Text en © 2015 Wood et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Wood, David L. A.
Nones, Katia
Steptoe, Anita
Christ, Angelika
Harliwong, Ivon
Newell, Felicity
Bruxner, Timothy J. C.
Miller, David
Cloonan, Nicole
Grimmond, Sean M.
Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data
title Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data
title_full Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data
title_fullStr Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data
title_full_unstemmed Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data
title_short Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data
title_sort recommendations for accurate resolution of gene and isoform allele-specific expression in rna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428808/
https://www.ncbi.nlm.nih.gov/pubmed/25965996
http://dx.doi.org/10.1371/journal.pone.0126911
work_keys_str_mv AT wooddavidla recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata
AT noneskatia recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata
AT steptoeanita recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata
AT christangelika recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata
AT harliwongivon recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata
AT newellfelicity recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata
AT bruxnertimothyjc recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata
AT millerdavid recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata
AT cloonannicole recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata
AT grimmondseanm recommendationsforaccurateresolutionofgeneandisoformallelespecificexpressioninrnaseqdata