Cargando…

A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation

BACKGROUND: Pyrosequencing techniques allow scientists to perform prokaryotic genome sequencing to achieve the draft genomic sequences within a few days. However, the assemblies with shotgun sequencing are usually composed of hundreds of contigs. A further multiplex PCR procedure is needed to fill a...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Jingwei, Li, Jun, Kwan, Hoi Shan, Au, Chun Hang, Wan Law, Patrick Tik, Li, Lei, Kam, Kai Man, Lun Ling, Julia Mei, Leung, Frederick C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3296665/
https://www.ncbi.nlm.nih.gov/pubmed/22289569
http://dx.doi.org/10.1186/1756-0500-5-80
_version_ 1782225770661281792
author Jiang, Jingwei
Li, Jun
Kwan, Hoi Shan
Au, Chun Hang
Wan Law, Patrick Tik
Li, Lei
Kam, Kai Man
Lun Ling, Julia Mei
Leung, Frederick C
author_facet Jiang, Jingwei
Li, Jun
Kwan, Hoi Shan
Au, Chun Hang
Wan Law, Patrick Tik
Li, Lei
Kam, Kai Man
Lun Ling, Julia Mei
Leung, Frederick C
author_sort Jiang, Jingwei
collection PubMed
description BACKGROUND: Pyrosequencing techniques allow scientists to perform prokaryotic genome sequencing to achieve the draft genomic sequences within a few days. However, the assemblies with shotgun sequencing are usually composed of hundreds of contigs. A further multiplex PCR procedure is needed to fill all the gaps and link contigs into complete chromosomal sequence, which is the basis for prokaryotic comparative genomic studies. In this article, we study various pyrosequencing strategies by simulated assembling from 100 prokaryotic genomes. FINDINGS: Simulation study shows that a single end 454 Jr. run combined with a paired end 454 Jr. run (8 kb library) can produce: 1) ~90% of 100 assemblies with < 10 scaffolds and ~95% of 100 assemblies with < 150 contigs; 2) average contig N50 size is over 331 kb; 3) average single base accuracy is > 99.99%; 4) average false gene duplication rate is < 0.7%; 5) average false gene loss rate is < 0.4%. CONCLUSIONS: A single end 454 Jr. run combined with a paired end 454 Jr. run (8 kb library) is a cost-effective way for prokaryotic whole genome sequencing. This strategy provides solution to produce high quality draft assemblies for most of prokaryotic organisms within days. Due to the small number of assembled scaffolds, the following multiplex PCR procedure (for gap filling) would be easy. As a result, large scale prokaryotic whole genome sequencing projects may be finished within weeks.
format Online
Article
Text
id pubmed-3296665
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32966652012-03-08 A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation Jiang, Jingwei Li, Jun Kwan, Hoi Shan Au, Chun Hang Wan Law, Patrick Tik Li, Lei Kam, Kai Man Lun Ling, Julia Mei Leung, Frederick C BMC Res Notes Short Report BACKGROUND: Pyrosequencing techniques allow scientists to perform prokaryotic genome sequencing to achieve the draft genomic sequences within a few days. However, the assemblies with shotgun sequencing are usually composed of hundreds of contigs. A further multiplex PCR procedure is needed to fill all the gaps and link contigs into complete chromosomal sequence, which is the basis for prokaryotic comparative genomic studies. In this article, we study various pyrosequencing strategies by simulated assembling from 100 prokaryotic genomes. FINDINGS: Simulation study shows that a single end 454 Jr. run combined with a paired end 454 Jr. run (8 kb library) can produce: 1) ~90% of 100 assemblies with < 10 scaffolds and ~95% of 100 assemblies with < 150 contigs; 2) average contig N50 size is over 331 kb; 3) average single base accuracy is > 99.99%; 4) average false gene duplication rate is < 0.7%; 5) average false gene loss rate is < 0.4%. CONCLUSIONS: A single end 454 Jr. run combined with a paired end 454 Jr. run (8 kb library) is a cost-effective way for prokaryotic whole genome sequencing. This strategy provides solution to produce high quality draft assemblies for most of prokaryotic organisms within days. Due to the small number of assembled scaffolds, the following multiplex PCR procedure (for gap filling) would be easy. As a result, large scale prokaryotic whole genome sequencing projects may be finished within weeks. BioMed Central 2012-01-31 /pmc/articles/PMC3296665/ /pubmed/22289569 http://dx.doi.org/10.1186/1756-0500-5-80 Text en Copyright ©2012 Jiang et al; BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Short Report
Jiang, Jingwei
Li, Jun
Kwan, Hoi Shan
Au, Chun Hang
Wan Law, Patrick Tik
Li, Lei
Kam, Kai Man
Lun Ling, Julia Mei
Leung, Frederick C
A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation
title A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation
title_full A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation
title_fullStr A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation
title_full_unstemmed A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation
title_short A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation
title_sort cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3296665/
https://www.ncbi.nlm.nih.gov/pubmed/22289569
http://dx.doi.org/10.1186/1756-0500-5-80
work_keys_str_mv AT jiangjingwei acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT lijun acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT kwanhoishan acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT auchunhang acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT wanlawpatricktik acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT lilei acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT kamkaiman acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT lunlingjuliamei acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT leungfrederickc acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT jiangjingwei costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT lijun costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT kwanhoishan costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT auchunhang costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT wanlawpatricktik costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT lilei costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT kamkaiman costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT lunlingjuliamei costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation
AT leungfrederickc costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation