Cargando…

A haplotype-resolved, de novo genome assembly for the wood tiger moth (Arctia plantaginis) through trio binning

BACKGROUND: Diploid genome assembly is typically impeded by heterozygosity because it introduces errors when haplotypes are collapsed into a consensus sequence. Trio binning offers an innovative solution that exploits heterozygosity for assembly. Short, parental reads are used to assign parental ori...

Descripción completa

Detalles Bibliográficos
Autores principales: Yen, Eugenie C, McCarthy, Shane A, Galarza, Juan A, Generalovic, Tomas N, Pelan, Sarah, Nguyen, Petr, Meier, Joana I, Warren, Ian A, Mappes, Johanna, Durbin, Richard, Jiggins, Chris D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7433188/
https://www.ncbi.nlm.nih.gov/pubmed/32808665
http://dx.doi.org/10.1093/gigascience/giaa088
_version_ 1783571956213219328
author Yen, Eugenie C
McCarthy, Shane A
Galarza, Juan A
Generalovic, Tomas N
Pelan, Sarah
Nguyen, Petr
Meier, Joana I
Warren, Ian A
Mappes, Johanna
Durbin, Richard
Jiggins, Chris D
author_facet Yen, Eugenie C
McCarthy, Shane A
Galarza, Juan A
Generalovic, Tomas N
Pelan, Sarah
Nguyen, Petr
Meier, Joana I
Warren, Ian A
Mappes, Johanna
Durbin, Richard
Jiggins, Chris D
author_sort Yen, Eugenie C
collection PubMed
description BACKGROUND: Diploid genome assembly is typically impeded by heterozygosity because it introduces errors when haplotypes are collapsed into a consensus sequence. Trio binning offers an innovative solution that exploits heterozygosity for assembly. Short, parental reads are used to assign parental origin to long reads from their F1 offspring before assembly, enabling complete haplotype resolution. Trio binning could therefore provide an effective strategy for assembling highly heterozygous genomes, which are traditionally problematic, such as insect genomes. This includes the wood tiger moth (Arctia plantaginis), which is an evolutionary study system for warning colour polymorphism. FINDINGS: We produced a high-quality, haplotype-resolved assembly for Arctia plantaginis through trio binning. We sequenced a same-species family (F1 heterozygosity ∼1.9%) and used parental Illumina reads to bin 99.98% of offspring Pacific Biosciences reads by parental origin, before assembling each haplotype separately and scaffolding with 10X linked reads. Both assemblies are contiguous (mean scaffold N50: 8.2 Mb) and complete (mean BUSCO completeness: 97.3%), with annotations and 31 chromosomes identified through karyotyping. We used the assembly to analyse genome-wide population structure and relationships between 40 wild resequenced individuals from 5 populations across Europe, revealing the Georgian population as the most genetically differentiated with the lowest genetic diversity. CONCLUSIONS: We present the first invertebrate genome to be assembled via trio binning. This assembly is one of the highest quality genomes available for Lepidoptera, supporting trio binning as a potent strategy for assembling heterozygous genomes. Using our assembly, we provide genomic insights into the geographic population structure of A. plantaginis.
format Online
Article
Text
id pubmed-7433188
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-74331882020-08-20 A haplotype-resolved, de novo genome assembly for the wood tiger moth (Arctia plantaginis) through trio binning Yen, Eugenie C McCarthy, Shane A Galarza, Juan A Generalovic, Tomas N Pelan, Sarah Nguyen, Petr Meier, Joana I Warren, Ian A Mappes, Johanna Durbin, Richard Jiggins, Chris D Gigascience Data Note BACKGROUND: Diploid genome assembly is typically impeded by heterozygosity because it introduces errors when haplotypes are collapsed into a consensus sequence. Trio binning offers an innovative solution that exploits heterozygosity for assembly. Short, parental reads are used to assign parental origin to long reads from their F1 offspring before assembly, enabling complete haplotype resolution. Trio binning could therefore provide an effective strategy for assembling highly heterozygous genomes, which are traditionally problematic, such as insect genomes. This includes the wood tiger moth (Arctia plantaginis), which is an evolutionary study system for warning colour polymorphism. FINDINGS: We produced a high-quality, haplotype-resolved assembly for Arctia plantaginis through trio binning. We sequenced a same-species family (F1 heterozygosity ∼1.9%) and used parental Illumina reads to bin 99.98% of offspring Pacific Biosciences reads by parental origin, before assembling each haplotype separately and scaffolding with 10X linked reads. Both assemblies are contiguous (mean scaffold N50: 8.2 Mb) and complete (mean BUSCO completeness: 97.3%), with annotations and 31 chromosomes identified through karyotyping. We used the assembly to analyse genome-wide population structure and relationships between 40 wild resequenced individuals from 5 populations across Europe, revealing the Georgian population as the most genetically differentiated with the lowest genetic diversity. CONCLUSIONS: We present the first invertebrate genome to be assembled via trio binning. This assembly is one of the highest quality genomes available for Lepidoptera, supporting trio binning as a potent strategy for assembling heterozygous genomes. Using our assembly, we provide genomic insights into the geographic population structure of A. plantaginis. Oxford University Press 2020-08-18 /pmc/articles/PMC7433188/ /pubmed/32808665 http://dx.doi.org/10.1093/gigascience/giaa088 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Yen, Eugenie C
McCarthy, Shane A
Galarza, Juan A
Generalovic, Tomas N
Pelan, Sarah
Nguyen, Petr
Meier, Joana I
Warren, Ian A
Mappes, Johanna
Durbin, Richard
Jiggins, Chris D
A haplotype-resolved, de novo genome assembly for the wood tiger moth (Arctia plantaginis) through trio binning
title A haplotype-resolved, de novo genome assembly for the wood tiger moth (Arctia plantaginis) through trio binning
title_full A haplotype-resolved, de novo genome assembly for the wood tiger moth (Arctia plantaginis) through trio binning
title_fullStr A haplotype-resolved, de novo genome assembly for the wood tiger moth (Arctia plantaginis) through trio binning
title_full_unstemmed A haplotype-resolved, de novo genome assembly for the wood tiger moth (Arctia plantaginis) through trio binning
title_short A haplotype-resolved, de novo genome assembly for the wood tiger moth (Arctia plantaginis) through trio binning
title_sort haplotype-resolved, de novo genome assembly for the wood tiger moth (arctia plantaginis) through trio binning
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7433188/
https://www.ncbi.nlm.nih.gov/pubmed/32808665
http://dx.doi.org/10.1093/gigascience/giaa088
work_keys_str_mv AT yeneugeniec ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT mccarthyshanea ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT galarzajuana ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT generalovictomasn ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT pelansarah ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT nguyenpetr ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT meierjoanai ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT warreniana ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT mappesjohanna ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT durbinrichard ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT jigginschrisd ahaplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT yeneugeniec haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT mccarthyshanea haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT galarzajuana haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT generalovictomasn haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT pelansarah haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT nguyenpetr haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT meierjoanai haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT warreniana haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT mappesjohanna haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT durbinrichard haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning
AT jigginschrisd haplotyperesolveddenovogenomeassemblyforthewoodtigermotharctiaplantaginisthroughtriobinning