Cargando…

The Chromosome-Scale Reference Genome of Macadamia tetraphylla Provides Insights Into Fatty Acid Biosynthesis

Macadamia is an evergreen tree belonging to the Proteaceae family. The two commercial macadamia species, Macadamia integrifolia and M. tetraphylla, are highly prized for their edible kernels. The M. integrifolia genome was recently sequenced, but the genome of M. tetraphylla has to date not been pub...

Descripción completa

Detalles Bibliográficos
Autores principales: Niu, Yingfeng, Li, Guohua, Ni, Shubang, He, Xiyong, Zheng, Cheng, Liu, Ziyan, Gong, Lidan, Kong, Guanghong, Li, Wei, Liu, Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8906886/
https://www.ncbi.nlm.nih.gov/pubmed/35281801
http://dx.doi.org/10.3389/fgene.2022.835363
_version_ 1784665485513588736
author Niu, Yingfeng
Li, Guohua
Ni, Shubang
He, Xiyong
Zheng, Cheng
Liu, Ziyan
Gong, Lidan
Kong, Guanghong
Li, Wei
Liu, Jin
author_facet Niu, Yingfeng
Li, Guohua
Ni, Shubang
He, Xiyong
Zheng, Cheng
Liu, Ziyan
Gong, Lidan
Kong, Guanghong
Li, Wei
Liu, Jin
author_sort Niu, Yingfeng
collection PubMed
description Macadamia is an evergreen tree belonging to the Proteaceae family. The two commercial macadamia species, Macadamia integrifolia and M. tetraphylla, are highly prized for their edible kernels. The M. integrifolia genome was recently sequenced, but the genome of M. tetraphylla has to date not been published, which limits the study of biological research and breeding in this species. This study reports a high-quality genome sequence of M. tetraphylla based on the Oxford Nanopore Technologies technology and high-throughput chromosome conformation capture techniques (Hi-C). An assembly of 750.87 Mb with 51.11 Mb N50 length was generated, close to the 740 and 758 Mb size estimates by flow cytometry and k-mer analysis, respectively. Genome annotation indicated that 61.42% of the genome is composed of repetitive sequences and 34.95% is composed of long terminal repeat retrotransposons. Up to 31,571 protein-coding genes were predicted, of which 92.59% were functionally annotated. The average gene length was 6,055 bp. Comparative genome analysis revealed that the gene families associated with defense response, lipid transport, steroid biosynthesis, triglyceride lipase activity, and fatty acid metabolism are expanded in the M. tetraphylla genome. The distribution of fourfold synonymous third-codon transversion showed a recent whole-genome duplication event in M. tetraphylla. Genomic and transcriptomic analysis identified 187 genes encoding 33 crucial oil biosynthesis enzymes, depicting a comprehensive map of macadamia lipid biosynthesis. Besides, the 55 identified WRKY genes exhibited preferential expression in root as compared to that in other tissues. The genome sequence of M. tetraphylla provides novel insights for breeding novel varieties and genetic improvement of agronomic traits.
format Online
Article
Text
id pubmed-8906886
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89068862022-03-10 The Chromosome-Scale Reference Genome of Macadamia tetraphylla Provides Insights Into Fatty Acid Biosynthesis Niu, Yingfeng Li, Guohua Ni, Shubang He, Xiyong Zheng, Cheng Liu, Ziyan Gong, Lidan Kong, Guanghong Li, Wei Liu, Jin Front Genet Genetics Macadamia is an evergreen tree belonging to the Proteaceae family. The two commercial macadamia species, Macadamia integrifolia and M. tetraphylla, are highly prized for their edible kernels. The M. integrifolia genome was recently sequenced, but the genome of M. tetraphylla has to date not been published, which limits the study of biological research and breeding in this species. This study reports a high-quality genome sequence of M. tetraphylla based on the Oxford Nanopore Technologies technology and high-throughput chromosome conformation capture techniques (Hi-C). An assembly of 750.87 Mb with 51.11 Mb N50 length was generated, close to the 740 and 758 Mb size estimates by flow cytometry and k-mer analysis, respectively. Genome annotation indicated that 61.42% of the genome is composed of repetitive sequences and 34.95% is composed of long terminal repeat retrotransposons. Up to 31,571 protein-coding genes were predicted, of which 92.59% were functionally annotated. The average gene length was 6,055 bp. Comparative genome analysis revealed that the gene families associated with defense response, lipid transport, steroid biosynthesis, triglyceride lipase activity, and fatty acid metabolism are expanded in the M. tetraphylla genome. The distribution of fourfold synonymous third-codon transversion showed a recent whole-genome duplication event in M. tetraphylla. Genomic and transcriptomic analysis identified 187 genes encoding 33 crucial oil biosynthesis enzymes, depicting a comprehensive map of macadamia lipid biosynthesis. Besides, the 55 identified WRKY genes exhibited preferential expression in root as compared to that in other tissues. The genome sequence of M. tetraphylla provides novel insights for breeding novel varieties and genetic improvement of agronomic traits. Frontiers Media S.A. 2022-02-23 /pmc/articles/PMC8906886/ /pubmed/35281801 http://dx.doi.org/10.3389/fgene.2022.835363 Text en Copyright © 2022 Niu, Li, Ni, He, Zheng, Liu, Gong, Kong, Li and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Niu, Yingfeng
Li, Guohua
Ni, Shubang
He, Xiyong
Zheng, Cheng
Liu, Ziyan
Gong, Lidan
Kong, Guanghong
Li, Wei
Liu, Jin
The Chromosome-Scale Reference Genome of Macadamia tetraphylla Provides Insights Into Fatty Acid Biosynthesis
title The Chromosome-Scale Reference Genome of Macadamia tetraphylla Provides Insights Into Fatty Acid Biosynthesis
title_full The Chromosome-Scale Reference Genome of Macadamia tetraphylla Provides Insights Into Fatty Acid Biosynthesis
title_fullStr The Chromosome-Scale Reference Genome of Macadamia tetraphylla Provides Insights Into Fatty Acid Biosynthesis
title_full_unstemmed The Chromosome-Scale Reference Genome of Macadamia tetraphylla Provides Insights Into Fatty Acid Biosynthesis
title_short The Chromosome-Scale Reference Genome of Macadamia tetraphylla Provides Insights Into Fatty Acid Biosynthesis
title_sort chromosome-scale reference genome of macadamia tetraphylla provides insights into fatty acid biosynthesis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8906886/
https://www.ncbi.nlm.nih.gov/pubmed/35281801
http://dx.doi.org/10.3389/fgene.2022.835363
work_keys_str_mv AT niuyingfeng thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT liguohua thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT nishubang thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT hexiyong thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT zhengcheng thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT liuziyan thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT gonglidan thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT kongguanghong thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT liwei thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT liujin thechromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT niuyingfeng chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT liguohua chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT nishubang chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT hexiyong chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT zhengcheng chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT liuziyan chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT gonglidan chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT kongguanghong chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT liwei chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis
AT liujin chromosomescalereferencegenomeofmacadamiatetraphyllaprovidesinsightsintofattyacidbiosynthesis