Cargando…

Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates

BACKGROUND: The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as...

Descripción completa

Detalles Bibliográficos
Autores principales: Blakesley, Robert W, Hansen, Nancy F, Gupta, Jyoti, McDowell, Jennifer C, Maskeri, Baishali, Barnabas, Beatrice B, Brooks, Shelise Y, Coleman, Holly, Haghighi, Payam, Ho, Shi-Ling, Schandler, Karen, Stantripop, Sirintorn, Vogt, Jennifer L, Thomas, Pamela J, Bouffard, Gerard G, Green, Eric D
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2827409/
https://www.ncbi.nlm.nih.gov/pubmed/20064230
http://dx.doi.org/10.1186/1471-2164-11-21
_version_ 1782177937040080896
author Blakesley, Robert W
Hansen, Nancy F
Gupta, Jyoti
McDowell, Jennifer C
Maskeri, Baishali
Barnabas, Beatrice B
Brooks, Shelise Y
Coleman, Holly
Haghighi, Payam
Ho, Shi-Ling
Schandler, Karen
Stantripop, Sirintorn
Vogt, Jennifer L
Thomas, Pamela J
Bouffard, Gerard G
Green, Eric D
author_facet Blakesley, Robert W
Hansen, Nancy F
Gupta, Jyoti
McDowell, Jennifer C
Maskeri, Baishali
Barnabas, Beatrice B
Brooks, Shelise Y
Coleman, Holly
Haghighi, Payam
Ho, Shi-Ling
Schandler, Karen
Stantripop, Sirintorn
Vogt, Jennifer L
Thomas, Pamela J
Bouffard, Gerard G
Green, Eric D
author_sort Blakesley, Robert W
collection PubMed
description BACKGROUND: The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as 'sequence finishing') remains tedious, labor-intensive, and expensive. As a result, the vast majority of vertebrate genome sequences generated to date remain at a draft stage. RESULTS: To date, our genome sequencing efforts have focused on comparative studies of targeted genomic regions, requiring sequence finishing of large blocks of orthologous sequence (average size 0.5-2 Mb) from various subsets of 75 vertebrates. This experience has provided a unique opportunity to compare the relative effort required to finish shotgun-generated genome sequence assemblies from different species, which we report here. Importantly, we found that the sequence assemblies generated for the same orthologous regions from various vertebrates show substantial variation with respect to misassemblies and, in particular, the frequency and characteristics of sequence gaps. As a consequence, the work required to finish different species' sequences varied greatly. Application of the same standardized methods for finishing provided a novel opportunity to "assay" characteristics of genome sequences among many vertebrate species. It is important to note that many of the problems we have encountered during sequence finishing reflect unique architectural features of a particular vertebrate's genome, which in some cases may have important functional and/or evolutionary implications. Finally, based on our analyses, we have been able to improve our procedures to overcome some of these problems and to increase the overall efficiency of the sequence-finishing process, although significant challenges still remain. CONCLUSION: Our findings have important implications for the eventual finishing of the draft whole-genome sequences that have now been generated for a large number of vertebrates.
format Text
id pubmed-2827409
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28274092010-02-24 Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates Blakesley, Robert W Hansen, Nancy F Gupta, Jyoti McDowell, Jennifer C Maskeri, Baishali Barnabas, Beatrice B Brooks, Shelise Y Coleman, Holly Haghighi, Payam Ho, Shi-Ling Schandler, Karen Stantripop, Sirintorn Vogt, Jennifer L Thomas, Pamela J Bouffard, Gerard G Green, Eric D BMC Genomics Research Article BACKGROUND: The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as 'sequence finishing') remains tedious, labor-intensive, and expensive. As a result, the vast majority of vertebrate genome sequences generated to date remain at a draft stage. RESULTS: To date, our genome sequencing efforts have focused on comparative studies of targeted genomic regions, requiring sequence finishing of large blocks of orthologous sequence (average size 0.5-2 Mb) from various subsets of 75 vertebrates. This experience has provided a unique opportunity to compare the relative effort required to finish shotgun-generated genome sequence assemblies from different species, which we report here. Importantly, we found that the sequence assemblies generated for the same orthologous regions from various vertebrates show substantial variation with respect to misassemblies and, in particular, the frequency and characteristics of sequence gaps. As a consequence, the work required to finish different species' sequences varied greatly. Application of the same standardized methods for finishing provided a novel opportunity to "assay" characteristics of genome sequences among many vertebrate species. It is important to note that many of the problems we have encountered during sequence finishing reflect unique architectural features of a particular vertebrate's genome, which in some cases may have important functional and/or evolutionary implications. Finally, based on our analyses, we have been able to improve our procedures to overcome some of these problems and to increase the overall efficiency of the sequence-finishing process, although significant challenges still remain. CONCLUSION: Our findings have important implications for the eventual finishing of the draft whole-genome sequences that have now been generated for a large number of vertebrates. BioMed Central 2010-01-11 /pmc/articles/PMC2827409/ /pubmed/20064230 http://dx.doi.org/10.1186/1471-2164-11-21 Text en Copyright ©2010 Blakesley et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Blakesley, Robert W
Hansen, Nancy F
Gupta, Jyoti
McDowell, Jennifer C
Maskeri, Baishali
Barnabas, Beatrice B
Brooks, Shelise Y
Coleman, Holly
Haghighi, Payam
Ho, Shi-Ling
Schandler, Karen
Stantripop, Sirintorn
Vogt, Jennifer L
Thomas, Pamela J
Bouffard, Gerard G
Green, Eric D
Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates
title Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates
title_full Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates
title_fullStr Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates
title_full_unstemmed Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates
title_short Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates
title_sort effort required to finish shotgun-generated genome sequences differs significantly among vertebrates
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2827409/
https://www.ncbi.nlm.nih.gov/pubmed/20064230
http://dx.doi.org/10.1186/1471-2164-11-21
work_keys_str_mv AT blakesleyrobertw effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT hansennancyf effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT guptajyoti effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT mcdowelljenniferc effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT maskeribaishali effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT barnabasbeatriceb effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT brooksshelisey effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT colemanholly effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT haghighipayam effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT hoshiling effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT schandlerkaren effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT stantripopsirintorn effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT vogtjenniferl effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT thomaspamelaj effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT bouffardgerardg effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates
AT greenericd effortrequiredtofinishshotgungeneratedgenomesequencesdifferssignificantlyamongvertebrates