Cargando…

ASEQ: fast allele-specific studies from next-generation sequencing data

BACKGROUND: Single base level information from next-generation sequencing (NGS) allows for the quantitative assessment of biological phenomena such as mosaicism or allele-specific features in healthy and diseased cells. Such studies often present with computationally challenging burdens that hinder...

Descripción completa

Detalles Bibliográficos
Autores principales: Romanel, Alessandro, Lago, Sara, Prandi, Davide, Sboner, Andrea, Demichelis, Francesca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4363342/
https://www.ncbi.nlm.nih.gov/pubmed/25889339
http://dx.doi.org/10.1186/s12920-015-0084-2
_version_ 1782361898791993344
author Romanel, Alessandro
Lago, Sara
Prandi, Davide
Sboner, Andrea
Demichelis, Francesca
author_facet Romanel, Alessandro
Lago, Sara
Prandi, Davide
Sboner, Andrea
Demichelis, Francesca
author_sort Romanel, Alessandro
collection PubMed
description BACKGROUND: Single base level information from next-generation sequencing (NGS) allows for the quantitative assessment of biological phenomena such as mosaicism or allele-specific features in healthy and diseased cells. Such studies often present with computationally challenging burdens that hinder genome-wide investigations across large datasets that are now becoming available through the 1,000 Genomes Project and The Cancer Genome Atlas (TCGA) initiatives. RESULTS: We present ASEQ, a tool to perform gene-level allele-specific expression (ASE) analysis from paired genomic and transcriptomic NGS data without requiring paternal and maternal genome data. ASEQ offers an easy-to-use set of modes that transparently to the user takes full advantage of a built-in fast computational engine. We report its performances on a set of 20 individuals from the 1,000 Genomes Project and show its detection power on imprinted genes. Next we demonstrate high level of ASE calls concordance when comparing it to AlleleSeq and MBASED tools. Finally, using a prostate cancer dataset we report on a higher fraction of ASE genes with respect to healthy individuals and show allele-specific events nominated by ASEQ in genes that are implicated in the disease. CONCLUSIONS: ASEQ can be used to rapidly and reliably screen large NGS datasets for the identification of allele specific features. It can be integrated in any NGS pipeline and runs on computer systems with multiple CPUs, CPUs with multiple cores or across clusters of machines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-015-0084-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4363342
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43633422015-03-19 ASEQ: fast allele-specific studies from next-generation sequencing data Romanel, Alessandro Lago, Sara Prandi, Davide Sboner, Andrea Demichelis, Francesca BMC Med Genomics Software BACKGROUND: Single base level information from next-generation sequencing (NGS) allows for the quantitative assessment of biological phenomena such as mosaicism or allele-specific features in healthy and diseased cells. Such studies often present with computationally challenging burdens that hinder genome-wide investigations across large datasets that are now becoming available through the 1,000 Genomes Project and The Cancer Genome Atlas (TCGA) initiatives. RESULTS: We present ASEQ, a tool to perform gene-level allele-specific expression (ASE) analysis from paired genomic and transcriptomic NGS data without requiring paternal and maternal genome data. ASEQ offers an easy-to-use set of modes that transparently to the user takes full advantage of a built-in fast computational engine. We report its performances on a set of 20 individuals from the 1,000 Genomes Project and show its detection power on imprinted genes. Next we demonstrate high level of ASE calls concordance when comparing it to AlleleSeq and MBASED tools. Finally, using a prostate cancer dataset we report on a higher fraction of ASE genes with respect to healthy individuals and show allele-specific events nominated by ASEQ in genes that are implicated in the disease. CONCLUSIONS: ASEQ can be used to rapidly and reliably screen large NGS datasets for the identification of allele specific features. It can be integrated in any NGS pipeline and runs on computer systems with multiple CPUs, CPUs with multiple cores or across clusters of machines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-015-0084-2) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-01 /pmc/articles/PMC4363342/ /pubmed/25889339 http://dx.doi.org/10.1186/s12920-015-0084-2 Text en © Romanel et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Romanel, Alessandro
Lago, Sara
Prandi, Davide
Sboner, Andrea
Demichelis, Francesca
ASEQ: fast allele-specific studies from next-generation sequencing data
title ASEQ: fast allele-specific studies from next-generation sequencing data
title_full ASEQ: fast allele-specific studies from next-generation sequencing data
title_fullStr ASEQ: fast allele-specific studies from next-generation sequencing data
title_full_unstemmed ASEQ: fast allele-specific studies from next-generation sequencing data
title_short ASEQ: fast allele-specific studies from next-generation sequencing data
title_sort aseq: fast allele-specific studies from next-generation sequencing data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4363342/
https://www.ncbi.nlm.nih.gov/pubmed/25889339
http://dx.doi.org/10.1186/s12920-015-0084-2
work_keys_str_mv AT romanelalessandro aseqfastallelespecificstudiesfromnextgenerationsequencingdata
AT lagosara aseqfastallelespecificstudiesfromnextgenerationsequencingdata
AT prandidavide aseqfastallelespecificstudiesfromnextgenerationsequencingdata
AT sbonerandrea aseqfastallelespecificstudiesfromnextgenerationsequencingdata
AT demichelisfrancesca aseqfastallelespecificstudiesfromnextgenerationsequencingdata