Cargando…

A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds

BACKGROUND: The ability to generate long sequencing reads and access long-range linkage information is revolutionizing the quality and completeness of genome assemblies. Here we use a hybrid approach that combines data from four genome sequencing and mapping technologies to generate a new genome ass...

Descripción completa

Detalles Bibliográficos
Autores principales: Wallberg, Andreas, Bunikis, Ignas, Pettersson, Olga Vinnere, Mosbech, Mai-Britt, Childers, Anna K., Evans, Jay D., Mikheyev, Alexander S., Robertson, Hugh M., Robinson, Gene E., Webster, Matthew T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454739/
https://www.ncbi.nlm.nih.gov/pubmed/30961563
http://dx.doi.org/10.1186/s12864-019-5642-0
_version_ 1783409600618299392
author Wallberg, Andreas
Bunikis, Ignas
Pettersson, Olga Vinnere
Mosbech, Mai-Britt
Childers, Anna K.
Evans, Jay D.
Mikheyev, Alexander S.
Robertson, Hugh M.
Robinson, Gene E.
Webster, Matthew T.
author_facet Wallberg, Andreas
Bunikis, Ignas
Pettersson, Olga Vinnere
Mosbech, Mai-Britt
Childers, Anna K.
Evans, Jay D.
Mikheyev, Alexander S.
Robertson, Hugh M.
Robinson, Gene E.
Webster, Matthew T.
author_sort Wallberg, Andreas
collection PubMed
description BACKGROUND: The ability to generate long sequencing reads and access long-range linkage information is revolutionizing the quality and completeness of genome assemblies. Here we use a hybrid approach that combines data from four genome sequencing and mapping technologies to generate a new genome assembly of the honeybee Apis mellifera. We first generated contigs based on PacBio sequencing libraries, which were then merged with linked-read 10x Chromium data followed by scaffolding using a BioNano optical genome map and a Hi-C chromatin interaction map, complemented by a genetic linkage map. RESULTS: Each of the assembly steps reduced the number of gaps and incorporated a substantial amount of additional sequence into scaffolds. The new assembly (Amel_HAv3) is significantly more contiguous and complete than the previous one (Amel_4.5), based mainly on Sanger sequencing reads. N50 of contigs is 120-fold higher (5.381 Mbp compared to 0.053 Mbp) and we anchor > 98% of the sequence to chromosomes. All of the 16 chromosomes are represented as single scaffolds with an average of three sequence gaps per chromosome. The improvements are largely due to the inclusion of repetitive sequence that was unplaced in previous assemblies. In particular, our assembly is highly contiguous across centromeres and telomeres and includes hundreds of AvaI and AluI repeats associated with these features. CONCLUSIONS: The improved assembly will be of utility for refining gene models, studying genome function, mapping functional genetic variation, identification of structural variants, and comparative genomics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5642-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6454739
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64547392019-04-19 A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds Wallberg, Andreas Bunikis, Ignas Pettersson, Olga Vinnere Mosbech, Mai-Britt Childers, Anna K. Evans, Jay D. Mikheyev, Alexander S. Robertson, Hugh M. Robinson, Gene E. Webster, Matthew T. BMC Genomics Research Article BACKGROUND: The ability to generate long sequencing reads and access long-range linkage information is revolutionizing the quality and completeness of genome assemblies. Here we use a hybrid approach that combines data from four genome sequencing and mapping technologies to generate a new genome assembly of the honeybee Apis mellifera. We first generated contigs based on PacBio sequencing libraries, which were then merged with linked-read 10x Chromium data followed by scaffolding using a BioNano optical genome map and a Hi-C chromatin interaction map, complemented by a genetic linkage map. RESULTS: Each of the assembly steps reduced the number of gaps and incorporated a substantial amount of additional sequence into scaffolds. The new assembly (Amel_HAv3) is significantly more contiguous and complete than the previous one (Amel_4.5), based mainly on Sanger sequencing reads. N50 of contigs is 120-fold higher (5.381 Mbp compared to 0.053 Mbp) and we anchor > 98% of the sequence to chromosomes. All of the 16 chromosomes are represented as single scaffolds with an average of three sequence gaps per chromosome. The improvements are largely due to the inclusion of repetitive sequence that was unplaced in previous assemblies. In particular, our assembly is highly contiguous across centromeres and telomeres and includes hundreds of AvaI and AluI repeats associated with these features. CONCLUSIONS: The improved assembly will be of utility for refining gene models, studying genome function, mapping functional genetic variation, identification of structural variants, and comparative genomics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5642-0) contains supplementary material, which is available to authorized users. BioMed Central 2019-04-08 /pmc/articles/PMC6454739/ /pubmed/30961563 http://dx.doi.org/10.1186/s12864-019-5642-0 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Wallberg, Andreas
Bunikis, Ignas
Pettersson, Olga Vinnere
Mosbech, Mai-Britt
Childers, Anna K.
Evans, Jay D.
Mikheyev, Alexander S.
Robertson, Hugh M.
Robinson, Gene E.
Webster, Matthew T.
A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds
title A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds
title_full A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds
title_fullStr A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds
title_full_unstemmed A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds
title_short A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds
title_sort hybrid de novo genome assembly of the honeybee, apis mellifera, with chromosome-length scaffolds
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454739/
https://www.ncbi.nlm.nih.gov/pubmed/30961563
http://dx.doi.org/10.1186/s12864-019-5642-0
work_keys_str_mv AT wallbergandreas ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT bunikisignas ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT petterssonolgavinnere ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT mosbechmaibritt ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT childersannak ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT evansjayd ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT mikheyevalexanders ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT robertsonhughm ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT robinsongenee ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT webstermatthewt ahybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT wallbergandreas hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT bunikisignas hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT petterssonolgavinnere hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT mosbechmaibritt hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT childersannak hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT evansjayd hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT mikheyevalexanders hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT robertsonhughm hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT robinsongenee hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds
AT webstermatthewt hybriddenovogenomeassemblyofthehoneybeeapismelliferawithchromosomelengthscaffolds