Cargando…

A haplotype-based normalization technique for the analysis and detection of allele specific expression

BACKGROUND: Allele specific expression (ASE) has become an important phenotype, being utilized for the detection of cis-regulatory variation, nonsense mediated decay and imprinting in the personal genome, and has been used to both identify disease loci and consider the penetrance of damaging alleles...

Descripción completa

Detalles Bibliográficos
Autores principales: Hodgkinson, Alan, Grenier, Jean-Christophe, Gbeha, Elias, Awadalla, Philip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5020486/
https://www.ncbi.nlm.nih.gov/pubmed/27618913
http://dx.doi.org/10.1186/s12859-016-1238-8
_version_ 1782453211917975552
author Hodgkinson, Alan
Grenier, Jean-Christophe
Gbeha, Elias
Awadalla, Philip
author_facet Hodgkinson, Alan
Grenier, Jean-Christophe
Gbeha, Elias
Awadalla, Philip
author_sort Hodgkinson, Alan
collection PubMed
description BACKGROUND: Allele specific expression (ASE) has become an important phenotype, being utilized for the detection of cis-regulatory variation, nonsense mediated decay and imprinting in the personal genome, and has been used to both identify disease loci and consider the penetrance of damaging alleles. The detection of ASE using high throughput technologies relies on aligning short-read sequencing data, a process that has inherent biases, and there is still a need to develop fast and accurate methods to detect ASE given the unprecedented growth of sequencing information in big data projects. RESULTS: Here, we present a new approach to normalize RNA sequencing data in order to call ASE events with high precision in a short time-frame. Using simulated datasets we find that our approach dramatically improves reference allele quantification at heterozygous sites versus default mapping methods and also performs well compared to existing techniques for ASE detection, such as filtering methods and mapping to parental genomes, without the need for complex and time consuming manipulation. Finally, by sequencing the exomes and transcriptomes of 96 well-phenotyped individuals of the CARTaGENE cohort, we characterise the levels of ASE across individuals and find a significant association between the proportion of sites undergoing ASE within the genome and smoking. CONCLUSIONS: The correct treatment and analysis of RNA sequencing data is vital to control for mapping biases and detect genuine ASE signals. By normalising RNA sequencing information after mapping, we show that this approach can be used to identify biologically relevant signals in personal genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1238-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5020486
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50204862016-09-20 A haplotype-based normalization technique for the analysis and detection of allele specific expression Hodgkinson, Alan Grenier, Jean-Christophe Gbeha, Elias Awadalla, Philip BMC Bioinformatics Methodology Article BACKGROUND: Allele specific expression (ASE) has become an important phenotype, being utilized for the detection of cis-regulatory variation, nonsense mediated decay and imprinting in the personal genome, and has been used to both identify disease loci and consider the penetrance of damaging alleles. The detection of ASE using high throughput technologies relies on aligning short-read sequencing data, a process that has inherent biases, and there is still a need to develop fast and accurate methods to detect ASE given the unprecedented growth of sequencing information in big data projects. RESULTS: Here, we present a new approach to normalize RNA sequencing data in order to call ASE events with high precision in a short time-frame. Using simulated datasets we find that our approach dramatically improves reference allele quantification at heterozygous sites versus default mapping methods and also performs well compared to existing techniques for ASE detection, such as filtering methods and mapping to parental genomes, without the need for complex and time consuming manipulation. Finally, by sequencing the exomes and transcriptomes of 96 well-phenotyped individuals of the CARTaGENE cohort, we characterise the levels of ASE across individuals and find a significant association between the proportion of sites undergoing ASE within the genome and smoking. CONCLUSIONS: The correct treatment and analysis of RNA sequencing data is vital to control for mapping biases and detect genuine ASE signals. By normalising RNA sequencing information after mapping, we show that this approach can be used to identify biologically relevant signals in personal genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1238-8) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-13 /pmc/articles/PMC5020486/ /pubmed/27618913 http://dx.doi.org/10.1186/s12859-016-1238-8 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Hodgkinson, Alan
Grenier, Jean-Christophe
Gbeha, Elias
Awadalla, Philip
A haplotype-based normalization technique for the analysis and detection of allele specific expression
title A haplotype-based normalization technique for the analysis and detection of allele specific expression
title_full A haplotype-based normalization technique for the analysis and detection of allele specific expression
title_fullStr A haplotype-based normalization technique for the analysis and detection of allele specific expression
title_full_unstemmed A haplotype-based normalization technique for the analysis and detection of allele specific expression
title_short A haplotype-based normalization technique for the analysis and detection of allele specific expression
title_sort haplotype-based normalization technique for the analysis and detection of allele specific expression
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5020486/
https://www.ncbi.nlm.nih.gov/pubmed/27618913
http://dx.doi.org/10.1186/s12859-016-1238-8
work_keys_str_mv AT hodgkinsonalan ahaplotypebasednormalizationtechniquefortheanalysisanddetectionofallelespecificexpression
AT grenierjeanchristophe ahaplotypebasednormalizationtechniquefortheanalysisanddetectionofallelespecificexpression
AT gbehaelias ahaplotypebasednormalizationtechniquefortheanalysisanddetectionofallelespecificexpression
AT awadallaphilip ahaplotypebasednormalizationtechniquefortheanalysisanddetectionofallelespecificexpression
AT hodgkinsonalan haplotypebasednormalizationtechniquefortheanalysisanddetectionofallelespecificexpression
AT grenierjeanchristophe haplotypebasednormalizationtechniquefortheanalysisanddetectionofallelespecificexpression
AT gbehaelias haplotypebasednormalizationtechniquefortheanalysisanddetectionofallelespecificexpression
AT awadallaphilip haplotypebasednormalizationtechniquefortheanalysisanddetectionofallelespecificexpression