Cargando…

Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes

BACKGROUND: The availability of diverse second- and third-generation sequencing technologies enables the rapid determination of the sequences of bacterial genomes. However, identifying the sequencing technology most suitable for producing a finished genome with multiple chromosomes remains a challen...

Descripción completa

Detalles Bibliográficos
Autores principales: Miyamoto, Mari, Motooka, Daisuke, Gotoh, Kazuyoshi, Imai, Takamasa, Yoshitake, Kazutoshi, Goto, Naohisa, Iida, Tetsuya, Yasunaga, Teruo, Horii, Toshihiro, Arakawa, Kazuharu, Kasahara, Masahiro, Nakamura, Shota
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4159541/
https://www.ncbi.nlm.nih.gov/pubmed/25142801
http://dx.doi.org/10.1186/1471-2164-15-699
_version_ 1782334242140717056
author Miyamoto, Mari
Motooka, Daisuke
Gotoh, Kazuyoshi
Imai, Takamasa
Yoshitake, Kazutoshi
Goto, Naohisa
Iida, Tetsuya
Yasunaga, Teruo
Horii, Toshihiro
Arakawa, Kazuharu
Kasahara, Masahiro
Nakamura, Shota
author_facet Miyamoto, Mari
Motooka, Daisuke
Gotoh, Kazuyoshi
Imai, Takamasa
Yoshitake, Kazutoshi
Goto, Naohisa
Iida, Tetsuya
Yasunaga, Teruo
Horii, Toshihiro
Arakawa, Kazuharu
Kasahara, Masahiro
Nakamura, Shota
author_sort Miyamoto, Mari
collection PubMed
description BACKGROUND: The availability of diverse second- and third-generation sequencing technologies enables the rapid determination of the sequences of bacterial genomes. However, identifying the sequencing technology most suitable for producing a finished genome with multiple chromosomes remains a challenge. We evaluated the abilities of the following three second-generation sequencers: Roche 454 GS Junior (GS Jr), Life Technologies Ion PGM (Ion PGM), and Illumina MiSeq (MiSeq) and a third-generation sequencer, the Pacific Biosciences RS sequencer (PacBio), by sequencing and assembling the genome of Vibrio parahaemolyticus, which consists of a 5-Mb genome comprising two circular chromosomes. RESULTS: We sequenced the genome of V. parahaemolyticus with GS Jr, Ion PGM, MiSeq, and PacBio and performed de novo assembly with several genome assemblers. Although GS Jr generated the longest mean read length of 418 bp among the second-generation sequencers, the maximum contig length of the best assembly from GS Jr was 165 kbp, and the number of contigs was 309. Single runs of Ion PGM and MiSeq produced data of considerably greater sequencing coverage, 279× and 1,927×, respectively. The optimized result for Ion PGM contained 61 contigs assembled from reads of 77× coverage, and the longest contig was 895 kbp in size. Those for MiSeq were 34 contigs, 58× coverage, and 733 kbp, respectively. These results suggest that higher coverage depth is unnecessary for a better assembly result. We observed that multiple rRNA coding regions were fragmented in the assemblies from the second-generation sequencers, whereas PacBio generated two exceptionally long contigs of 3,288,561 and 1,875,537 bps, each of which was from a single chromosome, with 73× coverage and mean read length 3,119 bp, allowing us to determine the absolute positions of all rRNA operons. CONCLUSIONS: PacBio outperformed the other sequencers in terms of the length of contigs and reconstructed the greatest portion of the genome, achieving a genome assembly of “finished grade” because of its long reads. It showed the potential to assemble more complex genomes with multiple chromosomes containing more repetitive sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-699) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4159541
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41595412014-09-19 Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes Miyamoto, Mari Motooka, Daisuke Gotoh, Kazuyoshi Imai, Takamasa Yoshitake, Kazutoshi Goto, Naohisa Iida, Tetsuya Yasunaga, Teruo Horii, Toshihiro Arakawa, Kazuharu Kasahara, Masahiro Nakamura, Shota BMC Genomics Research Article BACKGROUND: The availability of diverse second- and third-generation sequencing technologies enables the rapid determination of the sequences of bacterial genomes. However, identifying the sequencing technology most suitable for producing a finished genome with multiple chromosomes remains a challenge. We evaluated the abilities of the following three second-generation sequencers: Roche 454 GS Junior (GS Jr), Life Technologies Ion PGM (Ion PGM), and Illumina MiSeq (MiSeq) and a third-generation sequencer, the Pacific Biosciences RS sequencer (PacBio), by sequencing and assembling the genome of Vibrio parahaemolyticus, which consists of a 5-Mb genome comprising two circular chromosomes. RESULTS: We sequenced the genome of V. parahaemolyticus with GS Jr, Ion PGM, MiSeq, and PacBio and performed de novo assembly with several genome assemblers. Although GS Jr generated the longest mean read length of 418 bp among the second-generation sequencers, the maximum contig length of the best assembly from GS Jr was 165 kbp, and the number of contigs was 309. Single runs of Ion PGM and MiSeq produced data of considerably greater sequencing coverage, 279× and 1,927×, respectively. The optimized result for Ion PGM contained 61 contigs assembled from reads of 77× coverage, and the longest contig was 895 kbp in size. Those for MiSeq were 34 contigs, 58× coverage, and 733 kbp, respectively. These results suggest that higher coverage depth is unnecessary for a better assembly result. We observed that multiple rRNA coding regions were fragmented in the assemblies from the second-generation sequencers, whereas PacBio generated two exceptionally long contigs of 3,288,561 and 1,875,537 bps, each of which was from a single chromosome, with 73× coverage and mean read length 3,119 bp, allowing us to determine the absolute positions of all rRNA operons. CONCLUSIONS: PacBio outperformed the other sequencers in terms of the length of contigs and reconstructed the greatest portion of the genome, achieving a genome assembly of “finished grade” because of its long reads. It showed the potential to assemble more complex genomes with multiple chromosomes containing more repetitive sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-699) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-21 /pmc/articles/PMC4159541/ /pubmed/25142801 http://dx.doi.org/10.1186/1471-2164-15-699 Text en © Miyamoto et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Miyamoto, Mari
Motooka, Daisuke
Gotoh, Kazuyoshi
Imai, Takamasa
Yoshitake, Kazutoshi
Goto, Naohisa
Iida, Tetsuya
Yasunaga, Teruo
Horii, Toshihiro
Arakawa, Kazuharu
Kasahara, Masahiro
Nakamura, Shota
Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes
title Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes
title_full Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes
title_fullStr Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes
title_full_unstemmed Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes
title_short Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes
title_sort performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4159541/
https://www.ncbi.nlm.nih.gov/pubmed/25142801
http://dx.doi.org/10.1186/1471-2164-15-699
work_keys_str_mv AT miyamotomari performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT motookadaisuke performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT gotohkazuyoshi performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT imaitakamasa performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT yoshitakekazutoshi performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT gotonaohisa performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT iidatetsuya performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT yasunagateruo performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT horiitoshihiro performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT arakawakazuharu performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT kasaharamasahiro performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes
AT nakamurashota performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes