Cargando…

An improved genome assembly uncovers prolific tandem repeats in Atlantic cod

BACKGROUND: The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genome...

Descripción completa

Detalles Bibliográficos
Autores principales: Tørresen, Ole K., Star, Bastiaan, Jentoft, Sissel, Reinar, William B., Grove, Harald, Miller, Jason R., Walenz, Brian P., Knight, James, Ekholm, Jenny M., Peluso, Paul, Edvardsen, Rolf B., Tooming-Klunderud, Ave, Skage, Morten, Lien, Sigbjørn, Jakobsen, Kjetill S., Nederbragt, Alexander J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5241972/
https://www.ncbi.nlm.nih.gov/pubmed/28100185
http://dx.doi.org/10.1186/s12864-016-3448-x
_version_ 1782496273148936192
author Tørresen, Ole K.
Star, Bastiaan
Jentoft, Sissel
Reinar, William B.
Grove, Harald
Miller, Jason R.
Walenz, Brian P.
Knight, James
Ekholm, Jenny M.
Peluso, Paul
Edvardsen, Rolf B.
Tooming-Klunderud, Ave
Skage, Morten
Lien, Sigbjørn
Jakobsen, Kjetill S.
Nederbragt, Alexander J.
author_facet Tørresen, Ole K.
Star, Bastiaan
Jentoft, Sissel
Reinar, William B.
Grove, Harald
Miller, Jason R.
Walenz, Brian P.
Knight, James
Ekholm, Jenny M.
Peluso, Paul
Edvardsen, Rolf B.
Tooming-Klunderud, Ave
Skage, Morten
Lien, Sigbjørn
Jakobsen, Kjetill S.
Nederbragt, Alexander J.
author_sort Tørresen, Ole K.
collection PubMed
description BACKGROUND: The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. RESULTS: By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. CONCLUSIONS: The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3448-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5241972
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52419722017-01-23 An improved genome assembly uncovers prolific tandem repeats in Atlantic cod Tørresen, Ole K. Star, Bastiaan Jentoft, Sissel Reinar, William B. Grove, Harald Miller, Jason R. Walenz, Brian P. Knight, James Ekholm, Jenny M. Peluso, Paul Edvardsen, Rolf B. Tooming-Klunderud, Ave Skage, Morten Lien, Sigbjørn Jakobsen, Kjetill S. Nederbragt, Alexander J. BMC Genomics Research Article BACKGROUND: The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. RESULTS: By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. CONCLUSIONS: The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3448-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-18 /pmc/articles/PMC5241972/ /pubmed/28100185 http://dx.doi.org/10.1186/s12864-016-3448-x Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Tørresen, Ole K.
Star, Bastiaan
Jentoft, Sissel
Reinar, William B.
Grove, Harald
Miller, Jason R.
Walenz, Brian P.
Knight, James
Ekholm, Jenny M.
Peluso, Paul
Edvardsen, Rolf B.
Tooming-Klunderud, Ave
Skage, Morten
Lien, Sigbjørn
Jakobsen, Kjetill S.
Nederbragt, Alexander J.
An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_full An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_fullStr An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_full_unstemmed An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_short An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
title_sort improved genome assembly uncovers prolific tandem repeats in atlantic cod
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5241972/
https://www.ncbi.nlm.nih.gov/pubmed/28100185
http://dx.doi.org/10.1186/s12864-016-3448-x
work_keys_str_mv AT tørresenolek animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT starbastiaan animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT jentoftsissel animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT reinarwilliamb animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT groveharald animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT millerjasonr animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT walenzbrianp animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT knightjames animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT ekholmjennym animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT pelusopaul animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT edvardsenrolfb animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT toomingklunderudave animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT skagemorten animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT liensigbjørn animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT jakobsenkjetills animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT nederbragtalexanderj animprovedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT tørresenolek improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT starbastiaan improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT jentoftsissel improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT reinarwilliamb improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT groveharald improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT millerjasonr improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT walenzbrianp improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT knightjames improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT ekholmjennym improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT pelusopaul improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT edvardsenrolfb improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT toomingklunderudave improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT skagemorten improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT liensigbjørn improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT jakobsenkjetills improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod
AT nederbragtalexanderj improvedgenomeassemblyuncoversprolifictandemrepeatsinatlanticcod