Cargando…

SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes

Numerous methods have been developed to analyse RNA sequencing (RNA-seq) data, but most rely on the availability of a reference genome, making them unsuitable for non-model organisms. Here we present superTranscripts, a substitute for a reference genome, where each gene with multiple transcripts is...

Descripción completa

Detalles Bibliográficos
Autores principales: Davidson, Nadia M., Hawkins, Anthony D. K., Oshlack, Alicia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5543425/
https://www.ncbi.nlm.nih.gov/pubmed/28778180
http://dx.doi.org/10.1186/s13059-017-1284-1
_version_ 1783255146183000064
author Davidson, Nadia M.
Hawkins, Anthony D. K.
Oshlack, Alicia
author_facet Davidson, Nadia M.
Hawkins, Anthony D. K.
Oshlack, Alicia
author_sort Davidson, Nadia M.
collection PubMed
description Numerous methods have been developed to analyse RNA sequencing (RNA-seq) data, but most rely on the availability of a reference genome, making them unsuitable for non-model organisms. Here we present superTranscripts, a substitute for a reference genome, where each gene with multiple transcripts is represented by a single sequence. The Lace software is provided to construct superTranscripts from any set of transcripts, including de novo assemblies. We demonstrate how superTranscripts enable visualisation, variant detection and differential isoform detection in non-model organisms. We further use Lace to combine reference and assembled transcriptomes for chicken and recover hundreds of gaps in the reference genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1284-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5543425
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55434252017-08-07 SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes Davidson, Nadia M. Hawkins, Anthony D. K. Oshlack, Alicia Genome Biol Method Numerous methods have been developed to analyse RNA sequencing (RNA-seq) data, but most rely on the availability of a reference genome, making them unsuitable for non-model organisms. Here we present superTranscripts, a substitute for a reference genome, where each gene with multiple transcripts is represented by a single sequence. The Lace software is provided to construct superTranscripts from any set of transcripts, including de novo assemblies. We demonstrate how superTranscripts enable visualisation, variant detection and differential isoform detection in non-model organisms. We further use Lace to combine reference and assembled transcriptomes for chicken and recover hundreds of gaps in the reference genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1284-1) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-04 /pmc/articles/PMC5543425/ /pubmed/28778180 http://dx.doi.org/10.1186/s13059-017-1284-1 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Method
Davidson, Nadia M.
Hawkins, Anthony D. K.
Oshlack, Alicia
SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes
title SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes
title_full SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes
title_fullStr SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes
title_full_unstemmed SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes
title_short SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes
title_sort supertranscripts: a data driven reference for analysis and visualisation of transcriptomes
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5543425/
https://www.ncbi.nlm.nih.gov/pubmed/28778180
http://dx.doi.org/10.1186/s13059-017-1284-1
work_keys_str_mv AT davidsonnadiam supertranscriptsadatadrivenreferenceforanalysisandvisualisationoftranscriptomes
AT hawkinsanthonydk supertranscriptsadatadrivenreferenceforanalysisandvisualisationoftranscriptomes
AT oshlackalicia supertranscriptsadatadrivenreferenceforanalysisandvisualisationoftranscriptomes