Cargando…

Improving transcriptome assembly through error correction of high-throughput sequence reads

The study of functional genomics, particularly in non-model organisms, has been dramatically improved over the last few years by the use of transcriptomes and RNAseq. While these studies are potentially extremely powerful, a computationally intensive procedure, the de novo construction of a referenc...

Descripción completa

Detalles Bibliográficos
Autores principales:	MacManes, Matthew D., Eisen, Michael B.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2013
Materias:	Bioinformatics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3728768/ https://www.ncbi.nlm.nih.gov/pubmed/23904992 http://dx.doi.org/10.7717/peerj.113

_version_	1782278908259860480
author	MacManes, Matthew D. Eisen, Michael B.
author_facet	MacManes, Matthew D. Eisen, Michael B.
author_sort	MacManes, Matthew D.
collection	PubMed
description	The study of functional genomics, particularly in non-model organisms, has been dramatically improved over the last few years by the use of transcriptomes and RNAseq. While these studies are potentially extremely powerful, a computationally intensive procedure, the de novo construction of a reference transcriptome must be completed as a prerequisite to further analyses. The accurate reference is critically important as all downstream steps, including estimating transcript abundance are critically dependent on the construction of an accurate reference. Though a substantial amount of research has been done on assembly, only recently have the pre-assembly procedures been studied in detail. Specifically, several stand-alone error correction modules have been reported on and, while they have shown to be effective in reducing errors at the level of sequencing reads, how error correction impacts assembly accuracy is largely unknown. Here, we show via use of a simulated and empiric dataset, that applying error correction to sequencing reads has significant positive effects on assembly accuracy, and should be applied to all datasets. A complete collection of commands which will allow for the production of Reptile corrected reads is available at https://github.com/macmanes/error_correction/tree/master/scripts and as File S1.
format	Online Article Text
id	pubmed-3728768
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-37287682013-07-31 Improving transcriptome assembly through error correction of high-throughput sequence reads MacManes, Matthew D. Eisen, Michael B. PeerJ Bioinformatics The study of functional genomics, particularly in non-model organisms, has been dramatically improved over the last few years by the use of transcriptomes and RNAseq. While these studies are potentially extremely powerful, a computationally intensive procedure, the de novo construction of a reference transcriptome must be completed as a prerequisite to further analyses. The accurate reference is critically important as all downstream steps, including estimating transcript abundance are critically dependent on the construction of an accurate reference. Though a substantial amount of research has been done on assembly, only recently have the pre-assembly procedures been studied in detail. Specifically, several stand-alone error correction modules have been reported on and, while they have shown to be effective in reducing errors at the level of sequencing reads, how error correction impacts assembly accuracy is largely unknown. Here, we show via use of a simulated and empiric dataset, that applying error correction to sequencing reads has significant positive effects on assembly accuracy, and should be applied to all datasets. A complete collection of commands which will allow for the production of Reptile corrected reads is available at https://github.com/macmanes/error_correction/tree/master/scripts and as File S1. PeerJ Inc. 2013-07-23 /pmc/articles/PMC3728768/ /pubmed/23904992 http://dx.doi.org/10.7717/peerj.113 Text en © 2013 MacManes and Eisen http://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Bioinformatics MacManes, Matthew D. Eisen, Michael B. Improving transcriptome assembly through error correction of high-throughput sequence reads
title	Improving transcriptome assembly through error correction of high-throughput sequence reads
title_full	Improving transcriptome assembly through error correction of high-throughput sequence reads
title_fullStr	Improving transcriptome assembly through error correction of high-throughput sequence reads
title_full_unstemmed	Improving transcriptome assembly through error correction of high-throughput sequence reads
title_short	Improving transcriptome assembly through error correction of high-throughput sequence reads
title_sort	improving transcriptome assembly through error correction of high-throughput sequence reads
topic	Bioinformatics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3728768/ https://www.ncbi.nlm.nih.gov/pubmed/23904992 http://dx.doi.org/10.7717/peerj.113
work_keys_str_mv	AT macmanesmatthewd improvingtranscriptomeassemblythrougherrorcorrectionofhighthroughputsequencereads AT eisenmichaelb improvingtranscriptomeassemblythrougherrorcorrectionofhighthroughputsequencereads

Improving transcriptome assembly through error correction of high-throughput sequence reads

Ejemplares similares