Cargando…

Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids

BACKGROUND: De novo phased (haplo)genome assembly using long-read DNA sequencing data has improved the detection and characterization of structural variants (SVs) in plant and animal genomes. Able to span across haplotypes, long reads allow phased, haplogenome assembly in highly outbred organisms su...

Descripción completa

Detalles Bibliográficos
Autores principales: Lötter, Anneri, Duong, Tuan A, Candotti, Julia, Mizrachi, Eshchar, Wegrzyn, Jill L, Myburg, Alexander A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10460159/
https://www.ncbi.nlm.nih.gov/pubmed/37632754
http://dx.doi.org/10.1093/gigascience/giad064
_version_ 1785097581176553472
author Lötter, Anneri
Duong, Tuan A
Candotti, Julia
Mizrachi, Eshchar
Wegrzyn, Jill L
Myburg, Alexander A
author_facet Lötter, Anneri
Duong, Tuan A
Candotti, Julia
Mizrachi, Eshchar
Wegrzyn, Jill L
Myburg, Alexander A
author_sort Lötter, Anneri
collection PubMed
description BACKGROUND: De novo phased (haplo)genome assembly using long-read DNA sequencing data has improved the detection and characterization of structural variants (SVs) in plant and animal genomes. Able to span across haplotypes, long reads allow phased, haplogenome assembly in highly outbred organisms such as forest trees. Eucalyptus tree species and interspecific hybrids are the most widely planted hardwood trees with F1 hybrids of Eucalyptus grandis and E. urophylla forming the bulk of fast-growing pulpwood plantations in subtropical regions. The extent of structural variation and its effect on interspecific hybridization is unknown in these trees. As a first step towards elucidating the extent of structural variation between the genomes of E. grandis and E. urophylla, we sequenced and assembled the haplogenomes contained in an F1 hybrid of the two species. FINDINGS: Using Nanopore sequencing and a trio-binning approach, we assembled the separate haplogenomes (566.7 Mb and 544.5 Mb) to 98.0% BUSCO completion. High-density SNP genetic linkage maps of both parents allowed scaffolding of 88.0% of the haplogenome contigs into 11 pseudo-chromosomes (scaffold N50 of 43.8 Mb and 42.5 Mb for the E. grandis and E. urophylla haplogenomes, respectively). We identify 48,729 SVs between the two haplogenomes providing the first detailed insight into genome structural rearrangement in these species. The two haplogenomes have similar gene content, 35,572 and 33,915 functionally annotated genes, of which 34.7% are contained in genome rearrangements. CONCLUSIONS: Knowledge of SV and haplotype diversity in the two species will form the basis for understanding the genetic basis of hybrid superiority in these trees.
format Online
Article
Text
id pubmed-10460159
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104601592023-08-27 Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids Lötter, Anneri Duong, Tuan A Candotti, Julia Mizrachi, Eshchar Wegrzyn, Jill L Myburg, Alexander A Gigascience Data Note BACKGROUND: De novo phased (haplo)genome assembly using long-read DNA sequencing data has improved the detection and characterization of structural variants (SVs) in plant and animal genomes. Able to span across haplotypes, long reads allow phased, haplogenome assembly in highly outbred organisms such as forest trees. Eucalyptus tree species and interspecific hybrids are the most widely planted hardwood trees with F1 hybrids of Eucalyptus grandis and E. urophylla forming the bulk of fast-growing pulpwood plantations in subtropical regions. The extent of structural variation and its effect on interspecific hybridization is unknown in these trees. As a first step towards elucidating the extent of structural variation between the genomes of E. grandis and E. urophylla, we sequenced and assembled the haplogenomes contained in an F1 hybrid of the two species. FINDINGS: Using Nanopore sequencing and a trio-binning approach, we assembled the separate haplogenomes (566.7 Mb and 544.5 Mb) to 98.0% BUSCO completion. High-density SNP genetic linkage maps of both parents allowed scaffolding of 88.0% of the haplogenome contigs into 11 pseudo-chromosomes (scaffold N50 of 43.8 Mb and 42.5 Mb for the E. grandis and E. urophylla haplogenomes, respectively). We identify 48,729 SVs between the two haplogenomes providing the first detailed insight into genome structural rearrangement in these species. The two haplogenomes have similar gene content, 35,572 and 33,915 functionally annotated genes, of which 34.7% are contained in genome rearrangements. CONCLUSIONS: Knowledge of SV and haplotype diversity in the two species will form the basis for understanding the genetic basis of hybrid superiority in these trees. Oxford University Press 2023-08-26 /pmc/articles/PMC10460159/ /pubmed/37632754 http://dx.doi.org/10.1093/gigascience/giad064 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Lötter, Anneri
Duong, Tuan A
Candotti, Julia
Mizrachi, Eshchar
Wegrzyn, Jill L
Myburg, Alexander A
Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids
title Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids
title_full Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids
title_fullStr Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids
title_full_unstemmed Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids
title_short Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids
title_sort haplogenome assembly reveals structural variation in eucalyptus interspecific hybrids
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10460159/
https://www.ncbi.nlm.nih.gov/pubmed/37632754
http://dx.doi.org/10.1093/gigascience/giad064
work_keys_str_mv AT lotteranneri haplogenomeassemblyrevealsstructuralvariationineucalyptusinterspecifichybrids
AT duongtuana haplogenomeassemblyrevealsstructuralvariationineucalyptusinterspecifichybrids
AT candottijulia haplogenomeassemblyrevealsstructuralvariationineucalyptusinterspecifichybrids
AT mizrachieshchar haplogenomeassemblyrevealsstructuralvariationineucalyptusinterspecifichybrids
AT wegrzynjilll haplogenomeassemblyrevealsstructuralvariationineucalyptusinterspecifichybrids
AT myburgalexandera haplogenomeassemblyrevealsstructuralvariationineucalyptusinterspecifichybrids