Cargando…

IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing

Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from labor...

Descripción completa

Detalles Bibliográficos
Autores principales: Deonovic, Benjamin, Wang, Yunhao, Weirather, Jason, Wang, Xiu-Jie, Au, Kin Fai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5952581/
https://www.ncbi.nlm.nih.gov/pubmed/27899656
http://dx.doi.org/10.1093/nar/gkw1076
_version_ 1783323216784130048
author Deonovic, Benjamin
Wang, Yunhao
Weirather, Jason
Wang, Xiu-Jie
Au, Kin Fai
author_facet Deonovic, Benjamin
Wang, Yunhao
Weirather, Jason
Wang, Xiu-Jie
Au, Kin Fai
author_sort Deonovic, Benjamin
collection PubMed
description Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only.
format Online
Article
Text
id pubmed-5952581
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-59525812018-05-18 IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing Deonovic, Benjamin Wang, Yunhao Weirather, Jason Wang, Xiu-Jie Au, Kin Fai Nucleic Acids Res Methods Online Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. Oxford University Press 2017-03-17 2016-11-29 /pmc/articles/PMC5952581/ /pubmed/27899656 http://dx.doi.org/10.1093/nar/gkw1076 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Deonovic, Benjamin
Wang, Yunhao
Weirather, Jason
Wang, Xiu-Jie
Au, Kin Fai
IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing
title IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing
title_full IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing
title_fullStr IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing
title_full_unstemmed IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing
title_short IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing
title_sort idp-ase: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5952581/
https://www.ncbi.nlm.nih.gov/pubmed/27899656
http://dx.doi.org/10.1093/nar/gkw1076
work_keys_str_mv AT deonovicbenjamin idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing
AT wangyunhao idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing
AT weiratherjason idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing
AT wangxiujie idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing
AT aukinfai idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing