Cargando…

Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads

Recent improvements in technology have made DNA sequencing dramatically faster and more efficient than ever before. The new technologies produce highly accurate sequences, but one drawback is that the most efficient technology produces the shortest read lengths. Short-read sequencing has been applie...

Descripción completa

Detalles Bibliográficos
Autores principales: Salzberg, Steven L., Sommer, Daniel D., Puiu, Daniela, Lee, Vincent T.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2529408/
https://www.ncbi.nlm.nih.gov/pubmed/18818729
http://dx.doi.org/10.1371/journal.pcbi.1000186
_version_ 1782158928034922496
author Salzberg, Steven L.
Sommer, Daniel D.
Puiu, Daniela
Lee, Vincent T.
author_facet Salzberg, Steven L.
Sommer, Daniel D.
Puiu, Daniela
Lee, Vincent T.
author_sort Salzberg, Steven L.
collection PubMed
description Recent improvements in technology have made DNA sequencing dramatically faster and more efficient than ever before. The new technologies produce highly accurate sequences, but one drawback is that the most efficient technology produces the shortest read lengths. Short-read sequencing has been applied successfully to resequence the human genome and those of other species but not to whole-genome sequencing of novel organisms. Here we describe the sequencing and assembly of a novel clinical isolate of Pseudomonas aeruginosa, strain PAb1, using very short read technology. From 8,627,900 reads, each 33 nucleotides in length, we assembled the genome into one scaffold of 76 ordered contiguous sequences containing 6,290,005 nucleotides, including one contig spanning 512,638 nucleotides, plus an additional 436 unordered contigs containing 416,897 nucleotides. Our method includes a novel gene-boosting algorithm that uses amino acid sequences from predicted proteins to build a better assembly. This study demonstrates the feasibility of very short read sequencing for the sequencing of bacterial genomes, particularly those for which a related species has been sequenced previously, and expands the potential application of this new technology to most known prokaryotic species.
format Text
id pubmed-2529408
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-25294082008-09-26 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads Salzberg, Steven L. Sommer, Daniel D. Puiu, Daniela Lee, Vincent T. PLoS Comput Biol Research Article Recent improvements in technology have made DNA sequencing dramatically faster and more efficient than ever before. The new technologies produce highly accurate sequences, but one drawback is that the most efficient technology produces the shortest read lengths. Short-read sequencing has been applied successfully to resequence the human genome and those of other species but not to whole-genome sequencing of novel organisms. Here we describe the sequencing and assembly of a novel clinical isolate of Pseudomonas aeruginosa, strain PAb1, using very short read technology. From 8,627,900 reads, each 33 nucleotides in length, we assembled the genome into one scaffold of 76 ordered contiguous sequences containing 6,290,005 nucleotides, including one contig spanning 512,638 nucleotides, plus an additional 436 unordered contigs containing 416,897 nucleotides. Our method includes a novel gene-boosting algorithm that uses amino acid sequences from predicted proteins to build a better assembly. This study demonstrates the feasibility of very short read sequencing for the sequencing of bacterial genomes, particularly those for which a related species has been sequenced previously, and expands the potential application of this new technology to most known prokaryotic species. Public Library of Science 2008-09-26 /pmc/articles/PMC2529408/ /pubmed/18818729 http://dx.doi.org/10.1371/journal.pcbi.1000186 Text en Salzberg et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Salzberg, Steven L.
Sommer, Daniel D.
Puiu, Daniela
Lee, Vincent T.
Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads
title Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads
title_full Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads
title_fullStr Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads
title_full_unstemmed Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads
title_short Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads
title_sort gene-boosted assembly of a novel bacterial genome from very short reads
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2529408/
https://www.ncbi.nlm.nih.gov/pubmed/18818729
http://dx.doi.org/10.1371/journal.pcbi.1000186
work_keys_str_mv AT salzbergstevenl geneboostedassemblyofanovelbacterialgenomefromveryshortreads
AT sommerdanield geneboostedassemblyofanovelbacterialgenomefromveryshortreads
AT puiudaniela geneboostedassemblyofanovelbacterialgenomefromveryshortreads
AT leevincentt geneboostedassemblyofanovelbacterialgenomefromveryshortreads