Cargando…
A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation
BACKGROUND: Pyrosequencing techniques allow scientists to perform prokaryotic genome sequencing to achieve the draft genomic sequences within a few days. However, the assemblies with shotgun sequencing are usually composed of hundreds of contigs. A further multiplex PCR procedure is needed to fill a...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3296665/ https://www.ncbi.nlm.nih.gov/pubmed/22289569 http://dx.doi.org/10.1186/1756-0500-5-80 |
_version_ | 1782225770661281792 |
---|---|
author | Jiang, Jingwei Li, Jun Kwan, Hoi Shan Au, Chun Hang Wan Law, Patrick Tik Li, Lei Kam, Kai Man Lun Ling, Julia Mei Leung, Frederick C |
author_facet | Jiang, Jingwei Li, Jun Kwan, Hoi Shan Au, Chun Hang Wan Law, Patrick Tik Li, Lei Kam, Kai Man Lun Ling, Julia Mei Leung, Frederick C |
author_sort | Jiang, Jingwei |
collection | PubMed |
description | BACKGROUND: Pyrosequencing techniques allow scientists to perform prokaryotic genome sequencing to achieve the draft genomic sequences within a few days. However, the assemblies with shotgun sequencing are usually composed of hundreds of contigs. A further multiplex PCR procedure is needed to fill all the gaps and link contigs into complete chromosomal sequence, which is the basis for prokaryotic comparative genomic studies. In this article, we study various pyrosequencing strategies by simulated assembling from 100 prokaryotic genomes. FINDINGS: Simulation study shows that a single end 454 Jr. run combined with a paired end 454 Jr. run (8 kb library) can produce: 1) ~90% of 100 assemblies with < 10 scaffolds and ~95% of 100 assemblies with < 150 contigs; 2) average contig N50 size is over 331 kb; 3) average single base accuracy is > 99.99%; 4) average false gene duplication rate is < 0.7%; 5) average false gene loss rate is < 0.4%. CONCLUSIONS: A single end 454 Jr. run combined with a paired end 454 Jr. run (8 kb library) is a cost-effective way for prokaryotic whole genome sequencing. This strategy provides solution to produce high quality draft assemblies for most of prokaryotic organisms within days. Due to the small number of assembled scaffolds, the following multiplex PCR procedure (for gap filling) would be easy. As a result, large scale prokaryotic whole genome sequencing projects may be finished within weeks. |
format | Online Article Text |
id | pubmed-3296665 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32966652012-03-08 A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation Jiang, Jingwei Li, Jun Kwan, Hoi Shan Au, Chun Hang Wan Law, Patrick Tik Li, Lei Kam, Kai Man Lun Ling, Julia Mei Leung, Frederick C BMC Res Notes Short Report BACKGROUND: Pyrosequencing techniques allow scientists to perform prokaryotic genome sequencing to achieve the draft genomic sequences within a few days. However, the assemblies with shotgun sequencing are usually composed of hundreds of contigs. A further multiplex PCR procedure is needed to fill all the gaps and link contigs into complete chromosomal sequence, which is the basis for prokaryotic comparative genomic studies. In this article, we study various pyrosequencing strategies by simulated assembling from 100 prokaryotic genomes. FINDINGS: Simulation study shows that a single end 454 Jr. run combined with a paired end 454 Jr. run (8 kb library) can produce: 1) ~90% of 100 assemblies with < 10 scaffolds and ~95% of 100 assemblies with < 150 contigs; 2) average contig N50 size is over 331 kb; 3) average single base accuracy is > 99.99%; 4) average false gene duplication rate is < 0.7%; 5) average false gene loss rate is < 0.4%. CONCLUSIONS: A single end 454 Jr. run combined with a paired end 454 Jr. run (8 kb library) is a cost-effective way for prokaryotic whole genome sequencing. This strategy provides solution to produce high quality draft assemblies for most of prokaryotic organisms within days. Due to the small number of assembled scaffolds, the following multiplex PCR procedure (for gap filling) would be easy. As a result, large scale prokaryotic whole genome sequencing projects may be finished within weeks. BioMed Central 2012-01-31 /pmc/articles/PMC3296665/ /pubmed/22289569 http://dx.doi.org/10.1186/1756-0500-5-80 Text en Copyright ©2012 Jiang et al; BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Short Report Jiang, Jingwei Li, Jun Kwan, Hoi Shan Au, Chun Hang Wan Law, Patrick Tik Li, Lei Kam, Kai Man Lun Ling, Julia Mei Leung, Frederick C A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation |
title | A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation |
title_full | A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation |
title_fullStr | A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation |
title_full_unstemmed | A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation |
title_short | A cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation |
title_sort | cost-effective and universal strategy for complete prokaryotic genomic sequencing proposed by computer simulation |
topic | Short Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3296665/ https://www.ncbi.nlm.nih.gov/pubmed/22289569 http://dx.doi.org/10.1186/1756-0500-5-80 |
work_keys_str_mv | AT jiangjingwei acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT lijun acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT kwanhoishan acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT auchunhang acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT wanlawpatricktik acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT lilei acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT kamkaiman acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT lunlingjuliamei acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT leungfrederickc acosteffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT jiangjingwei costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT lijun costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT kwanhoishan costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT auchunhang costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT wanlawpatricktik costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT lilei costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT kamkaiman costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT lunlingjuliamei costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation AT leungfrederickc costeffectiveanduniversalstrategyforcompleteprokaryoticgenomicsequencingproposedbycomputersimulation |