Cargando…

Addressing challenges in the production and analysis of illumina sequencing data

Advances in DNA sequencing technologies have made it possible to generate large amounts of sequence data very rapidly and at substantially lower cost than capillary sequencing. These new technologies have specific characteristics and limitations that require either consideration during project desig...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kircher, Martin, Heyn, Patricia, Kelso, Janet
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Correspondence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3163567/ https://www.ncbi.nlm.nih.gov/pubmed/21801405 http://dx.doi.org/10.1186/1471-2164-12-382

_version_	1782210963137626112
author	Kircher, Martin Heyn, Patricia Kelso, Janet
author_facet	Kircher, Martin Heyn, Patricia Kelso, Janet
author_sort	Kircher, Martin
collection	PubMed
description	Advances in DNA sequencing technologies have made it possible to generate large amounts of sequence data very rapidly and at substantially lower cost than capillary sequencing. These new technologies have specific characteristics and limitations that require either consideration during project design, or which must be addressed during data analysis. Specialist skills, both at the laboratory and the computational stages of project design and analysis, are crucial to the generation of high quality data from these new platforms. The Illumina sequencers (including the Genome Analyzers I/II/IIe/IIx and the new HiScan and HiSeq) represent a widely used platform providing parallel readout of several hundred million immobilized sequences using fluorescent-dye reversible-terminator chemistry. Sequencing library quality, sample handling, instrument settings and sequencing chemistry have a strong impact on sequencing run quality. The presence of adapter chimeras and adapter sequences at the end of short-insert molecules, as well as increased error rates and short read lengths complicate many computational analyses. We discuss here some of the factors that influence the frequency and severity of these problems and provide solutions for circumventing these. Further, we present a set of general principles for good analysis practice that enable problems with sequencing runs to be identified and dealt with.
format	Online Article Text
id	pubmed-3163567
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31635672011-08-30 Addressing challenges in the production and analysis of illumina sequencing data Kircher, Martin Heyn, Patricia Kelso, Janet BMC Genomics Correspondence Advances in DNA sequencing technologies have made it possible to generate large amounts of sequence data very rapidly and at substantially lower cost than capillary sequencing. These new technologies have specific characteristics and limitations that require either consideration during project design, or which must be addressed during data analysis. Specialist skills, both at the laboratory and the computational stages of project design and analysis, are crucial to the generation of high quality data from these new platforms. The Illumina sequencers (including the Genome Analyzers I/II/IIe/IIx and the new HiScan and HiSeq) represent a widely used platform providing parallel readout of several hundred million immobilized sequences using fluorescent-dye reversible-terminator chemistry. Sequencing library quality, sample handling, instrument settings and sequencing chemistry have a strong impact on sequencing run quality. The presence of adapter chimeras and adapter sequences at the end of short-insert molecules, as well as increased error rates and short read lengths complicate many computational analyses. We discuss here some of the factors that influence the frequency and severity of these problems and provide solutions for circumventing these. Further, we present a set of general principles for good analysis practice that enable problems with sequencing runs to be identified and dealt with. BioMed Central 2011-07-29 /pmc/articles/PMC3163567/ /pubmed/21801405 http://dx.doi.org/10.1186/1471-2164-12-382 Text en Copyright ©2011 Kircher et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Correspondence Kircher, Martin Heyn, Patricia Kelso, Janet Addressing challenges in the production and analysis of illumina sequencing data
title	Addressing challenges in the production and analysis of illumina sequencing data
title_full	Addressing challenges in the production and analysis of illumina sequencing data
title_fullStr	Addressing challenges in the production and analysis of illumina sequencing data
title_full_unstemmed	Addressing challenges in the production and analysis of illumina sequencing data
title_short	Addressing challenges in the production and analysis of illumina sequencing data
title_sort	addressing challenges in the production and analysis of illumina sequencing data
topic	Correspondence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3163567/ https://www.ncbi.nlm.nih.gov/pubmed/21801405 http://dx.doi.org/10.1186/1471-2164-12-382
work_keys_str_mv	AT kirchermartin addressingchallengesintheproductionandanalysisofilluminasequencingdata AT heynpatricia addressingchallengesintheproductionandanalysisofilluminasequencingdata AT kelsojanet addressingchallengesintheproductionandanalysisofilluminasequencingdata

Addressing challenges in the production and analysis of illumina sequencing data

Ejemplares similares