Cargando…

Chromosome-Scale Assembly and Annotation of the Macadamia Genome (Macadamia integrifolia HAES 741)

Macadamia integrifolia is a representative of the large basal eudicot family Proteaceae and the main progenitor species of the Australian native nut crop macadamia. Since its commercialisation in Hawaii fewer than 100 years ago, global production has expanded rapidly. However, genomic resources are...

Descripción completa

Detalles Bibliográficos
Autores principales: Nock, Catherine J., Baten, Abdul, Mauleon, Ramil, Langdon, Kirsty S., Topp, Bruce, Hardner, Craig, Furtado, Agnelo, Henry, Robert J., King, Graham J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7534425/
https://www.ncbi.nlm.nih.gov/pubmed/32747341
http://dx.doi.org/10.1534/g3.120.401326
_version_ 1783590310426705920
author Nock, Catherine J.
Baten, Abdul
Mauleon, Ramil
Langdon, Kirsty S.
Topp, Bruce
Hardner, Craig
Furtado, Agnelo
Henry, Robert J.
King, Graham J.
author_facet Nock, Catherine J.
Baten, Abdul
Mauleon, Ramil
Langdon, Kirsty S.
Topp, Bruce
Hardner, Craig
Furtado, Agnelo
Henry, Robert J.
King, Graham J.
author_sort Nock, Catherine J.
collection PubMed
description Macadamia integrifolia is a representative of the large basal eudicot family Proteaceae and the main progenitor species of the Australian native nut crop macadamia. Since its commercialisation in Hawaii fewer than 100 years ago, global production has expanded rapidly. However, genomic resources are limited in comparison to other horticultural crops. The first draft assembly of M. integrifolia had good coverage of the functional gene space but its high fragmentation has restricted its use in comparative genomics and association studies. Here we have generated an improved assembly of cultivar HAES 741 (4,094 scaffolds, 745 Mb, N50 413 kb) using a combination of Illumina paired and PacBio long read sequences. Scaffolds were anchored to 14 pseudo-chromosomes using seven genetic linkage maps. This assembly has improved contiguity and coverage, with >120 Gb of additional sequence. Following annotation, 34,274 protein-coding genes were predicted, representing 90% of the expected gene content. Our results indicate that the macadamia genome is repetitive and heterozygous. The total repeat content was 55% and genome-wide heterozygosity, estimated by read mapping, was 0.98% or an average of one SNP per 102 bp. This is the first chromosome-scale genome assembly for macadamia and the Proteaceae. It is expected to be a valuable resource for breeding, gene discovery, conservation and evolutionary genomics.
format Online
Article
Text
id pubmed-7534425
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-75344252020-10-13 Chromosome-Scale Assembly and Annotation of the Macadamia Genome (Macadamia integrifolia HAES 741) Nock, Catherine J. Baten, Abdul Mauleon, Ramil Langdon, Kirsty S. Topp, Bruce Hardner, Craig Furtado, Agnelo Henry, Robert J. King, Graham J. G3 (Bethesda) Genome Report Macadamia integrifolia is a representative of the large basal eudicot family Proteaceae and the main progenitor species of the Australian native nut crop macadamia. Since its commercialisation in Hawaii fewer than 100 years ago, global production has expanded rapidly. However, genomic resources are limited in comparison to other horticultural crops. The first draft assembly of M. integrifolia had good coverage of the functional gene space but its high fragmentation has restricted its use in comparative genomics and association studies. Here we have generated an improved assembly of cultivar HAES 741 (4,094 scaffolds, 745 Mb, N50 413 kb) using a combination of Illumina paired and PacBio long read sequences. Scaffolds were anchored to 14 pseudo-chromosomes using seven genetic linkage maps. This assembly has improved contiguity and coverage, with >120 Gb of additional sequence. Following annotation, 34,274 protein-coding genes were predicted, representing 90% of the expected gene content. Our results indicate that the macadamia genome is repetitive and heterozygous. The total repeat content was 55% and genome-wide heterozygosity, estimated by read mapping, was 0.98% or an average of one SNP per 102 bp. This is the first chromosome-scale genome assembly for macadamia and the Proteaceae. It is expected to be a valuable resource for breeding, gene discovery, conservation and evolutionary genomics. Genetics Society of America 2020-08-03 /pmc/articles/PMC7534425/ /pubmed/32747341 http://dx.doi.org/10.1534/g3.120.401326 Text en Copyright © 2020 Nock et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genome Report
Nock, Catherine J.
Baten, Abdul
Mauleon, Ramil
Langdon, Kirsty S.
Topp, Bruce
Hardner, Craig
Furtado, Agnelo
Henry, Robert J.
King, Graham J.
Chromosome-Scale Assembly and Annotation of the Macadamia Genome (Macadamia integrifolia HAES 741)
title Chromosome-Scale Assembly and Annotation of the Macadamia Genome (Macadamia integrifolia HAES 741)
title_full Chromosome-Scale Assembly and Annotation of the Macadamia Genome (Macadamia integrifolia HAES 741)
title_fullStr Chromosome-Scale Assembly and Annotation of the Macadamia Genome (Macadamia integrifolia HAES 741)
title_full_unstemmed Chromosome-Scale Assembly and Annotation of the Macadamia Genome (Macadamia integrifolia HAES 741)
title_short Chromosome-Scale Assembly and Annotation of the Macadamia Genome (Macadamia integrifolia HAES 741)
title_sort chromosome-scale assembly and annotation of the macadamia genome (macadamia integrifolia haes 741)
topic Genome Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7534425/
https://www.ncbi.nlm.nih.gov/pubmed/32747341
http://dx.doi.org/10.1534/g3.120.401326
work_keys_str_mv AT nockcatherinej chromosomescaleassemblyandannotationofthemacadamiagenomemacadamiaintegrifoliahaes741
AT batenabdul chromosomescaleassemblyandannotationofthemacadamiagenomemacadamiaintegrifoliahaes741
AT mauleonramil chromosomescaleassemblyandannotationofthemacadamiagenomemacadamiaintegrifoliahaes741
AT langdonkirstys chromosomescaleassemblyandannotationofthemacadamiagenomemacadamiaintegrifoliahaes741
AT toppbruce chromosomescaleassemblyandannotationofthemacadamiagenomemacadamiaintegrifoliahaes741
AT hardnercraig chromosomescaleassemblyandannotationofthemacadamiagenomemacadamiaintegrifoliahaes741
AT furtadoagnelo chromosomescaleassemblyandannotationofthemacadamiagenomemacadamiaintegrifoliahaes741
AT henryrobertj chromosomescaleassemblyandannotationofthemacadamiagenomemacadamiaintegrifoliahaes741
AT kinggrahamj chromosomescaleassemblyandannotationofthemacadamiagenomemacadamiaintegrifoliahaes741