Cargando…

An improved genome reference for the African cichlid, Metriaclima zebra

BACKGROUND: Problems associated with using draft genome assemblies are well documented and have become more pronounced with the use of short read data for de novo genome assembly. We set out to improve the draft genome assembly of the African cichlid fish, Metriaclima zebra, using a set of Pacific B...

Descripción completa

Detalles Bibliográficos
Autores principales: Conte, Matthew A., Kocher, Thomas D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4580222/
https://www.ncbi.nlm.nih.gov/pubmed/26394688
http://dx.doi.org/10.1186/s12864-015-1930-5
_version_ 1782391368471019520
author Conte, Matthew A.
Kocher, Thomas D.
author_facet Conte, Matthew A.
Kocher, Thomas D.
author_sort Conte, Matthew A.
collection PubMed
description BACKGROUND: Problems associated with using draft genome assemblies are well documented and have become more pronounced with the use of short read data for de novo genome assembly. We set out to improve the draft genome assembly of the African cichlid fish, Metriaclima zebra, using a set of Pacific Biosciences SMRT sequencing reads corresponding to 16.5× coverage of the genome. Here we characterize the improvements that these long reads allowed us to make to the state-of-the-art draft genome previously assembled from short read data. RESULTS: Our new assembly closed 68 % of the existing gaps and added 90.6Mbp of new non-gap sequence to the existing draft assembly of M. zebra. Comparison of the new assembly to the sequence of several bacterial artificial chromosome clones confirmed the accuracy of the new assembly. The closure of sequence gaps revealed thousands of new exons, allowing significant improvement in gene models. We corrected one known misassembly, and identified and fixed other likely misassemblies. 63.5 Mbp (70 %) of the new sequence was classified as repetitive and the new sequence allowed for the assembly of many more transposable elements. CONCLUSIONS: Our improvements to the M. zebra draft genome suggest that a reasonable investment in long reads could greatly improve many comparable vertebrate draft genome assemblies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1930-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4580222
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45802222015-09-24 An improved genome reference for the African cichlid, Metriaclima zebra Conte, Matthew A. Kocher, Thomas D. BMC Genomics Research Article BACKGROUND: Problems associated with using draft genome assemblies are well documented and have become more pronounced with the use of short read data for de novo genome assembly. We set out to improve the draft genome assembly of the African cichlid fish, Metriaclima zebra, using a set of Pacific Biosciences SMRT sequencing reads corresponding to 16.5× coverage of the genome. Here we characterize the improvements that these long reads allowed us to make to the state-of-the-art draft genome previously assembled from short read data. RESULTS: Our new assembly closed 68 % of the existing gaps and added 90.6Mbp of new non-gap sequence to the existing draft assembly of M. zebra. Comparison of the new assembly to the sequence of several bacterial artificial chromosome clones confirmed the accuracy of the new assembly. The closure of sequence gaps revealed thousands of new exons, allowing significant improvement in gene models. We corrected one known misassembly, and identified and fixed other likely misassemblies. 63.5 Mbp (70 %) of the new sequence was classified as repetitive and the new sequence allowed for the assembly of many more transposable elements. CONCLUSIONS: Our improvements to the M. zebra draft genome suggest that a reasonable investment in long reads could greatly improve many comparable vertebrate draft genome assemblies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1930-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-22 /pmc/articles/PMC4580222/ /pubmed/26394688 http://dx.doi.org/10.1186/s12864-015-1930-5 Text en © Conte and Kocher. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Conte, Matthew A.
Kocher, Thomas D.
An improved genome reference for the African cichlid, Metriaclima zebra
title An improved genome reference for the African cichlid, Metriaclima zebra
title_full An improved genome reference for the African cichlid, Metriaclima zebra
title_fullStr An improved genome reference for the African cichlid, Metriaclima zebra
title_full_unstemmed An improved genome reference for the African cichlid, Metriaclima zebra
title_short An improved genome reference for the African cichlid, Metriaclima zebra
title_sort improved genome reference for the african cichlid, metriaclima zebra
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4580222/
https://www.ncbi.nlm.nih.gov/pubmed/26394688
http://dx.doi.org/10.1186/s12864-015-1930-5
work_keys_str_mv AT contematthewa animprovedgenomereferencefortheafricancichlidmetriaclimazebra
AT kocherthomasd animprovedgenomereferencefortheafricancichlidmetriaclimazebra
AT contematthewa improvedgenomereferencefortheafricancichlidmetriaclimazebra
AT kocherthomasd improvedgenomereferencefortheafricancichlidmetriaclimazebra