Cargando…
Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome
BACKGROUND: With a whole genome duplication event and wealth of biological data, salmonids are excellent model organisms for studying evolutionary processes, fates of duplicated genes and genetic and physiological processes associated with complex behavioral phenotypes. It is surprising therefore, t...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2532694/ https://www.ncbi.nlm.nih.gov/pubmed/18755037 http://dx.doi.org/10.1186/1471-2164-9-404 |
_version_ | 1782158989148028928 |
---|---|
author | Quinn, Nicole L Levenkova, Natasha Chow, William Bouffard, Pascal Boroevich, Keith A Knight, James R Jarvie, Thomas P Lubieniecki, Krzysztof P Desany, Brian A Koop, Ben F Harkins, Timothy T Davidson, William S |
author_facet | Quinn, Nicole L Levenkova, Natasha Chow, William Bouffard, Pascal Boroevich, Keith A Knight, James R Jarvie, Thomas P Lubieniecki, Krzysztof P Desany, Brian A Koop, Ben F Harkins, Timothy T Davidson, William S |
author_sort | Quinn, Nicole L |
collection | PubMed |
description | BACKGROUND: With a whole genome duplication event and wealth of biological data, salmonids are excellent model organisms for studying evolutionary processes, fates of duplicated genes and genetic and physiological processes associated with complex behavioral phenotypes. It is surprising therefore, that no salmonid genome has been sequenced. Atlantic salmon (Salmo salar) is a good representative salmonid for sequencing given its importance in aquaculture and the genomic resources available. However, the size and complexity of the genome combined with the lack of a sequenced reference genome from a closely related fish makes assembly challenging. Given the cost and time limitations of Sanger sequencing as well as recent improvements to next generation sequencing technologies, we examined the feasibility of using the Genome Sequencer (GS) FLX pyrosequencing system to obtain the sequence of a salmonid genome. Eight pooled BACs belonging to a minimum tiling path covering ~1 Mb of the Atlantic salmon genome were sequenced by GS FLX shotgun and Long Paired End sequencing and compared with a ninth BAC sequenced by Sanger sequencing of a shotgun library. RESULTS: An initial assembly using only GS FLX shotgun sequences (average read length 248.5 bp) with ~30× coverage allowed gene identification, but was incomplete even when 126 Sanger-generated BAC-end sequences (~0.09× coverage) were incorporated. The addition of paired end sequencing reads (additional ~26× coverage) produced a final assembly comprising 175 contigs assembled into four scaffolds with 171 gaps. Sanger sequencing of the ninth BAC (~10.5× coverage) produced nine contigs and two scaffolds. The number of scaffolds produced by the GS FLX assembly was comparable to Sanger-generated sequencing; however, the number of gaps was much higher in the GS FLX assembly. CONCLUSION: These results represent the first use of GS FLX paired end reads for de novo sequence assembly. Our data demonstrated that this improved the GS FLX assemblies; however, with respect to de novo sequencing of complex genomes, the GS FLX technology is limited to gene mining and establishing a set of ordered sequence contigs. Currently, for a salmonid reference sequence, it appears that a substantial portion of sequencing should be done using Sanger technology. |
format | Text |
id | pubmed-2532694 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-25326942008-09-09 Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome Quinn, Nicole L Levenkova, Natasha Chow, William Bouffard, Pascal Boroevich, Keith A Knight, James R Jarvie, Thomas P Lubieniecki, Krzysztof P Desany, Brian A Koop, Ben F Harkins, Timothy T Davidson, William S BMC Genomics Research Article BACKGROUND: With a whole genome duplication event and wealth of biological data, salmonids are excellent model organisms for studying evolutionary processes, fates of duplicated genes and genetic and physiological processes associated with complex behavioral phenotypes. It is surprising therefore, that no salmonid genome has been sequenced. Atlantic salmon (Salmo salar) is a good representative salmonid for sequencing given its importance in aquaculture and the genomic resources available. However, the size and complexity of the genome combined with the lack of a sequenced reference genome from a closely related fish makes assembly challenging. Given the cost and time limitations of Sanger sequencing as well as recent improvements to next generation sequencing technologies, we examined the feasibility of using the Genome Sequencer (GS) FLX pyrosequencing system to obtain the sequence of a salmonid genome. Eight pooled BACs belonging to a minimum tiling path covering ~1 Mb of the Atlantic salmon genome were sequenced by GS FLX shotgun and Long Paired End sequencing and compared with a ninth BAC sequenced by Sanger sequencing of a shotgun library. RESULTS: An initial assembly using only GS FLX shotgun sequences (average read length 248.5 bp) with ~30× coverage allowed gene identification, but was incomplete even when 126 Sanger-generated BAC-end sequences (~0.09× coverage) were incorporated. The addition of paired end sequencing reads (additional ~26× coverage) produced a final assembly comprising 175 contigs assembled into four scaffolds with 171 gaps. Sanger sequencing of the ninth BAC (~10.5× coverage) produced nine contigs and two scaffolds. The number of scaffolds produced by the GS FLX assembly was comparable to Sanger-generated sequencing; however, the number of gaps was much higher in the GS FLX assembly. CONCLUSION: These results represent the first use of GS FLX paired end reads for de novo sequence assembly. Our data demonstrated that this improved the GS FLX assemblies; however, with respect to de novo sequencing of complex genomes, the GS FLX technology is limited to gene mining and establishing a set of ordered sequence contigs. Currently, for a salmonid reference sequence, it appears that a substantial portion of sequencing should be done using Sanger technology. BioMed Central 2008-08-28 /pmc/articles/PMC2532694/ /pubmed/18755037 http://dx.doi.org/10.1186/1471-2164-9-404 Text en Copyright © 2008 Quinn et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Quinn, Nicole L Levenkova, Natasha Chow, William Bouffard, Pascal Boroevich, Keith A Knight, James R Jarvie, Thomas P Lubieniecki, Krzysztof P Desany, Brian A Koop, Ben F Harkins, Timothy T Davidson, William S Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome |
title | Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome |
title_full | Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome |
title_fullStr | Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome |
title_full_unstemmed | Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome |
title_short | Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome |
title_sort | assessing the feasibility of gs flx pyrosequencing for sequencing the atlantic salmon genome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2532694/ https://www.ncbi.nlm.nih.gov/pubmed/18755037 http://dx.doi.org/10.1186/1471-2164-9-404 |
work_keys_str_mv | AT quinnnicolel assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT levenkovanatasha assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT chowwilliam assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT bouffardpascal assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT boroevichkeitha assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT knightjamesr assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT jarviethomasp assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT lubienieckikrzysztofp assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT desanybriana assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT koopbenf assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT harkinstimothyt assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome AT davidsonwilliams assessingthefeasibilityofgsflxpyrosequencingforsequencingtheatlanticsalmongenome |