Cargando…
IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing
Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from labor...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5952581/ https://www.ncbi.nlm.nih.gov/pubmed/27899656 http://dx.doi.org/10.1093/nar/gkw1076 |
_version_ | 1783323216784130048 |
---|---|
author | Deonovic, Benjamin Wang, Yunhao Weirather, Jason Wang, Xiu-Jie Au, Kin Fai |
author_facet | Deonovic, Benjamin Wang, Yunhao Weirather, Jason Wang, Xiu-Jie Au, Kin Fai |
author_sort | Deonovic, Benjamin |
collection | PubMed |
description | Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. |
format | Online Article Text |
id | pubmed-5952581 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-59525812018-05-18 IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing Deonovic, Benjamin Wang, Yunhao Weirather, Jason Wang, Xiu-Jie Au, Kin Fai Nucleic Acids Res Methods Online Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. Oxford University Press 2017-03-17 2016-11-29 /pmc/articles/PMC5952581/ /pubmed/27899656 http://dx.doi.org/10.1093/nar/gkw1076 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Deonovic, Benjamin Wang, Yunhao Weirather, Jason Wang, Xiu-Jie Au, Kin Fai IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing |
title | IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing |
title_full | IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing |
title_fullStr | IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing |
title_full_unstemmed | IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing |
title_short | IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing |
title_sort | idp-ase: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5952581/ https://www.ncbi.nlm.nih.gov/pubmed/27899656 http://dx.doi.org/10.1093/nar/gkw1076 |
work_keys_str_mv | AT deonovicbenjamin idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing AT wangyunhao idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing AT weiratherjason idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing AT wangxiujie idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing AT aukinfai idpasehaplotypingandquantifyingallelespecificexpressionatthegeneandgeneisoformlevelbyhybridsequencing |