Cargando…
Corset: enabling differential gene expression analysis for de novo assembled transcriptomes
Next generation sequencing has made it possible to perform differential gene expression studies in non-model organisms. For these studies, the need for a reference genome is circumvented by performing de novo assembly on the RNA-seq data. However, transcriptome assembly produces a multitude of conti...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4165373/ https://www.ncbi.nlm.nih.gov/pubmed/25063469 http://dx.doi.org/10.1186/s13059-014-0410-6 |
_version_ | 1782335094425387008 |
---|---|
author | Davidson, Nadia M Oshlack, Alicia |
author_facet | Davidson, Nadia M Oshlack, Alicia |
author_sort | Davidson, Nadia M |
collection | PubMed |
description | Next generation sequencing has made it possible to perform differential gene expression studies in non-model organisms. For these studies, the need for a reference genome is circumvented by performing de novo assembly on the RNA-seq data. However, transcriptome assembly produces a multitude of contigs, which must be clustered into genes prior to differential gene expression detection. Here we present Corset, a method that hierarchically clusters contigs using shared reads and expression, then summarizes read counts to clusters, ready for statistical testing. Using a range of metrics, we demonstrate that Corset out-performs alternative methods. Corset is available from https://code.google.com/p/corset-project/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-014-0410-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4165373 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41653732014-09-26 Corset: enabling differential gene expression analysis for de novo assembled transcriptomes Davidson, Nadia M Oshlack, Alicia Genome Biol Method Next generation sequencing has made it possible to perform differential gene expression studies in non-model organisms. For these studies, the need for a reference genome is circumvented by performing de novo assembly on the RNA-seq data. However, transcriptome assembly produces a multitude of contigs, which must be clustered into genes prior to differential gene expression detection. Here we present Corset, a method that hierarchically clusters contigs using shared reads and expression, then summarizes read counts to clusters, ready for statistical testing. Using a range of metrics, we demonstrate that Corset out-performs alternative methods. Corset is available from https://code.google.com/p/corset-project/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-014-0410-6) contains supplementary material, which is available to authorized users. BioMed Central 2014-07-26 2014 /pmc/articles/PMC4165373/ /pubmed/25063469 http://dx.doi.org/10.1186/s13059-014-0410-6 Text en © Davidson and Oshlack; licensee BioMed Central Ltd 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Method Davidson, Nadia M Oshlack, Alicia Corset: enabling differential gene expression analysis for de novo assembled transcriptomes |
title | Corset: enabling differential gene expression analysis for de novo assembled transcriptomes |
title_full | Corset: enabling differential gene expression analysis for de novo assembled transcriptomes |
title_fullStr | Corset: enabling differential gene expression analysis for de novo assembled transcriptomes |
title_full_unstemmed | Corset: enabling differential gene expression analysis for de novo assembled transcriptomes |
title_short | Corset: enabling differential gene expression analysis for de novo assembled transcriptomes |
title_sort | corset: enabling differential gene expression analysis for de novo assembled transcriptomes |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4165373/ https://www.ncbi.nlm.nih.gov/pubmed/25063469 http://dx.doi.org/10.1186/s13059-014-0410-6 |
work_keys_str_mv | AT davidsonnadiam corsetenablingdifferentialgeneexpressionanalysisfordenovoassembledtranscriptomes AT oshlackalicia corsetenablingdifferentialgeneexpressionanalysisfordenovoassembledtranscriptomes |