Cargando…
Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes
BACKGROUND: The availability of diverse second- and third-generation sequencing technologies enables the rapid determination of the sequences of bacterial genomes. However, identifying the sequencing technology most suitable for producing a finished genome with multiple chromosomes remains a challen...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4159541/ https://www.ncbi.nlm.nih.gov/pubmed/25142801 http://dx.doi.org/10.1186/1471-2164-15-699 |
_version_ | 1782334242140717056 |
---|---|
author | Miyamoto, Mari Motooka, Daisuke Gotoh, Kazuyoshi Imai, Takamasa Yoshitake, Kazutoshi Goto, Naohisa Iida, Tetsuya Yasunaga, Teruo Horii, Toshihiro Arakawa, Kazuharu Kasahara, Masahiro Nakamura, Shota |
author_facet | Miyamoto, Mari Motooka, Daisuke Gotoh, Kazuyoshi Imai, Takamasa Yoshitake, Kazutoshi Goto, Naohisa Iida, Tetsuya Yasunaga, Teruo Horii, Toshihiro Arakawa, Kazuharu Kasahara, Masahiro Nakamura, Shota |
author_sort | Miyamoto, Mari |
collection | PubMed |
description | BACKGROUND: The availability of diverse second- and third-generation sequencing technologies enables the rapid determination of the sequences of bacterial genomes. However, identifying the sequencing technology most suitable for producing a finished genome with multiple chromosomes remains a challenge. We evaluated the abilities of the following three second-generation sequencers: Roche 454 GS Junior (GS Jr), Life Technologies Ion PGM (Ion PGM), and Illumina MiSeq (MiSeq) and a third-generation sequencer, the Pacific Biosciences RS sequencer (PacBio), by sequencing and assembling the genome of Vibrio parahaemolyticus, which consists of a 5-Mb genome comprising two circular chromosomes. RESULTS: We sequenced the genome of V. parahaemolyticus with GS Jr, Ion PGM, MiSeq, and PacBio and performed de novo assembly with several genome assemblers. Although GS Jr generated the longest mean read length of 418 bp among the second-generation sequencers, the maximum contig length of the best assembly from GS Jr was 165 kbp, and the number of contigs was 309. Single runs of Ion PGM and MiSeq produced data of considerably greater sequencing coverage, 279× and 1,927×, respectively. The optimized result for Ion PGM contained 61 contigs assembled from reads of 77× coverage, and the longest contig was 895 kbp in size. Those for MiSeq were 34 contigs, 58× coverage, and 733 kbp, respectively. These results suggest that higher coverage depth is unnecessary for a better assembly result. We observed that multiple rRNA coding regions were fragmented in the assemblies from the second-generation sequencers, whereas PacBio generated two exceptionally long contigs of 3,288,561 and 1,875,537 bps, each of which was from a single chromosome, with 73× coverage and mean read length 3,119 bp, allowing us to determine the absolute positions of all rRNA operons. CONCLUSIONS: PacBio outperformed the other sequencers in terms of the length of contigs and reconstructed the greatest portion of the genome, achieving a genome assembly of “finished grade” because of its long reads. It showed the potential to assemble more complex genomes with multiple chromosomes containing more repetitive sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-699) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4159541 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41595412014-09-19 Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes Miyamoto, Mari Motooka, Daisuke Gotoh, Kazuyoshi Imai, Takamasa Yoshitake, Kazutoshi Goto, Naohisa Iida, Tetsuya Yasunaga, Teruo Horii, Toshihiro Arakawa, Kazuharu Kasahara, Masahiro Nakamura, Shota BMC Genomics Research Article BACKGROUND: The availability of diverse second- and third-generation sequencing technologies enables the rapid determination of the sequences of bacterial genomes. However, identifying the sequencing technology most suitable for producing a finished genome with multiple chromosomes remains a challenge. We evaluated the abilities of the following three second-generation sequencers: Roche 454 GS Junior (GS Jr), Life Technologies Ion PGM (Ion PGM), and Illumina MiSeq (MiSeq) and a third-generation sequencer, the Pacific Biosciences RS sequencer (PacBio), by sequencing and assembling the genome of Vibrio parahaemolyticus, which consists of a 5-Mb genome comprising two circular chromosomes. RESULTS: We sequenced the genome of V. parahaemolyticus with GS Jr, Ion PGM, MiSeq, and PacBio and performed de novo assembly with several genome assemblers. Although GS Jr generated the longest mean read length of 418 bp among the second-generation sequencers, the maximum contig length of the best assembly from GS Jr was 165 kbp, and the number of contigs was 309. Single runs of Ion PGM and MiSeq produced data of considerably greater sequencing coverage, 279× and 1,927×, respectively. The optimized result for Ion PGM contained 61 contigs assembled from reads of 77× coverage, and the longest contig was 895 kbp in size. Those for MiSeq were 34 contigs, 58× coverage, and 733 kbp, respectively. These results suggest that higher coverage depth is unnecessary for a better assembly result. We observed that multiple rRNA coding regions were fragmented in the assemblies from the second-generation sequencers, whereas PacBio generated two exceptionally long contigs of 3,288,561 and 1,875,537 bps, each of which was from a single chromosome, with 73× coverage and mean read length 3,119 bp, allowing us to determine the absolute positions of all rRNA operons. CONCLUSIONS: PacBio outperformed the other sequencers in terms of the length of contigs and reconstructed the greatest portion of the genome, achieving a genome assembly of “finished grade” because of its long reads. It showed the potential to assemble more complex genomes with multiple chromosomes containing more repetitive sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-699) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-21 /pmc/articles/PMC4159541/ /pubmed/25142801 http://dx.doi.org/10.1186/1471-2164-15-699 Text en © Miyamoto et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Miyamoto, Mari Motooka, Daisuke Gotoh, Kazuyoshi Imai, Takamasa Yoshitake, Kazutoshi Goto, Naohisa Iida, Tetsuya Yasunaga, Teruo Horii, Toshihiro Arakawa, Kazuharu Kasahara, Masahiro Nakamura, Shota Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes |
title | Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes |
title_full | Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes |
title_fullStr | Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes |
title_full_unstemmed | Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes |
title_short | Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes |
title_sort | performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4159541/ https://www.ncbi.nlm.nih.gov/pubmed/25142801 http://dx.doi.org/10.1186/1471-2164-15-699 |
work_keys_str_mv | AT miyamotomari performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT motookadaisuke performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT gotohkazuyoshi performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT imaitakamasa performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT yoshitakekazutoshi performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT gotonaohisa performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT iidatetsuya performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT yasunagateruo performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT horiitoshihiro performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT arakawakazuharu performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT kasaharamasahiro performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes AT nakamurashota performancecomparisonofsecondandthirdgenerationsequencersusingabacterialgenomewithtwochromosomes |