Cargando…
A telomere-to-telomere reference genome provides genetic insight into the pentacyclic triterpenoid biosynthesis in Chaenomeles speciosa
Chaenomeles speciosa (2n = 34), a medicinal and edible plant in the Rosaceae, is commonly used in traditional Chinese medicine. To date, the lack of genomic sequence and genetic studies has impeded efforts to improve its medicinal value. Herein, we report the use of an integrative approach involving...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623406/ https://www.ncbi.nlm.nih.gov/pubmed/37927407 http://dx.doi.org/10.1093/hr/uhad183 |
_version_ | 1785130733114753024 |
---|---|
author | He, Shaofang Weng, Duanyang Zhang, Yipeng Kong, Qiusheng Wang, Keyue Jing, Naliang Li, Fengfeng Ge, Yuebin Xiong, Hui Wu, Lei Xie, De-Yu Feng, Shengqiu Yu, Xiaqing Wang, Xuekui Shu, Shaohua Mei, Zhinan |
author_facet | He, Shaofang Weng, Duanyang Zhang, Yipeng Kong, Qiusheng Wang, Keyue Jing, Naliang Li, Fengfeng Ge, Yuebin Xiong, Hui Wu, Lei Xie, De-Yu Feng, Shengqiu Yu, Xiaqing Wang, Xuekui Shu, Shaohua Mei, Zhinan |
author_sort | He, Shaofang |
collection | PubMed |
description | Chaenomeles speciosa (2n = 34), a medicinal and edible plant in the Rosaceae, is commonly used in traditional Chinese medicine. To date, the lack of genomic sequence and genetic studies has impeded efforts to improve its medicinal value. Herein, we report the use of an integrative approach involving PacBio HiFi (third-generation) sequencing and Hi-C scaffolding to assemble a high-quality telomere-to-telomere genome of C. speciosa. The genome comprised 650.4 Mb with a contig N50 of 35.5 Mb. Of these, 632.3 Mb were anchored to 17 pseudo-chromosomes, in which 12, 4, and 1 pseudo-chromosomes were represented by a single contig, two contigs, and four contigs, respectively. Eleven pseudo-chromosomes had telomere repeats at both ends, and four had telomere repeats at a single end. Repetitive sequences accounted for 49.5% of the genome, while a total of 45 515 protein-coding genes have been annotated. The genome size of C. speciosa was relatively similar to that of Malus domestica. Expanded or contracted gene families were identified and investigated for their association with different plant metabolisms or biological processes. In particular, functional annotation characterized gene families that were associated with the biosynthetic pathway of oleanolic and ursolic acids, two abundant pentacyclic triterpenoids in the fruits of C. speciosa. Taken together, this telomere-to-telomere and chromosome-level genome of C. speciosa not only provides a valuable resource to enhance understanding of the biosynthesis of medicinal compounds in tissues, but also promotes understanding of the evolution of the Rosaceae. |
format | Online Article Text |
id | pubmed-10623406 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106234062023-11-04 A telomere-to-telomere reference genome provides genetic insight into the pentacyclic triterpenoid biosynthesis in Chaenomeles speciosa He, Shaofang Weng, Duanyang Zhang, Yipeng Kong, Qiusheng Wang, Keyue Jing, Naliang Li, Fengfeng Ge, Yuebin Xiong, Hui Wu, Lei Xie, De-Yu Feng, Shengqiu Yu, Xiaqing Wang, Xuekui Shu, Shaohua Mei, Zhinan Hortic Res Article Chaenomeles speciosa (2n = 34), a medicinal and edible plant in the Rosaceae, is commonly used in traditional Chinese medicine. To date, the lack of genomic sequence and genetic studies has impeded efforts to improve its medicinal value. Herein, we report the use of an integrative approach involving PacBio HiFi (third-generation) sequencing and Hi-C scaffolding to assemble a high-quality telomere-to-telomere genome of C. speciosa. The genome comprised 650.4 Mb with a contig N50 of 35.5 Mb. Of these, 632.3 Mb were anchored to 17 pseudo-chromosomes, in which 12, 4, and 1 pseudo-chromosomes were represented by a single contig, two contigs, and four contigs, respectively. Eleven pseudo-chromosomes had telomere repeats at both ends, and four had telomere repeats at a single end. Repetitive sequences accounted for 49.5% of the genome, while a total of 45 515 protein-coding genes have been annotated. The genome size of C. speciosa was relatively similar to that of Malus domestica. Expanded or contracted gene families were identified and investigated for their association with different plant metabolisms or biological processes. In particular, functional annotation characterized gene families that were associated with the biosynthetic pathway of oleanolic and ursolic acids, two abundant pentacyclic triterpenoids in the fruits of C. speciosa. Taken together, this telomere-to-telomere and chromosome-level genome of C. speciosa not only provides a valuable resource to enhance understanding of the biosynthesis of medicinal compounds in tissues, but also promotes understanding of the evolution of the Rosaceae. Oxford University Press 2023-09-14 /pmc/articles/PMC10623406/ /pubmed/37927407 http://dx.doi.org/10.1093/hr/uhad183 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nanjing Agricultural University. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Article He, Shaofang Weng, Duanyang Zhang, Yipeng Kong, Qiusheng Wang, Keyue Jing, Naliang Li, Fengfeng Ge, Yuebin Xiong, Hui Wu, Lei Xie, De-Yu Feng, Shengqiu Yu, Xiaqing Wang, Xuekui Shu, Shaohua Mei, Zhinan A telomere-to-telomere reference genome provides genetic insight into the pentacyclic triterpenoid biosynthesis in Chaenomeles speciosa |
title | A telomere-to-telomere reference genome provides genetic insight into the pentacyclic triterpenoid biosynthesis in Chaenomeles speciosa |
title_full | A telomere-to-telomere reference genome provides genetic insight into the pentacyclic triterpenoid biosynthesis in Chaenomeles speciosa |
title_fullStr | A telomere-to-telomere reference genome provides genetic insight into the pentacyclic triterpenoid biosynthesis in Chaenomeles speciosa |
title_full_unstemmed | A telomere-to-telomere reference genome provides genetic insight into the pentacyclic triterpenoid biosynthesis in Chaenomeles speciosa |
title_short | A telomere-to-telomere reference genome provides genetic insight into the pentacyclic triterpenoid biosynthesis in Chaenomeles speciosa |
title_sort | telomere-to-telomere reference genome provides genetic insight into the pentacyclic triterpenoid biosynthesis in chaenomeles speciosa |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623406/ https://www.ncbi.nlm.nih.gov/pubmed/37927407 http://dx.doi.org/10.1093/hr/uhad183 |
work_keys_str_mv | AT heshaofang atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT wengduanyang atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT zhangyipeng atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT kongqiusheng atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT wangkeyue atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT jingnaliang atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT lifengfeng atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT geyuebin atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT xionghui atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT wulei atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT xiedeyu atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT fengshengqiu atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT yuxiaqing atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT wangxuekui atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT shushaohua atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT meizhinan atelomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT heshaofang telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT wengduanyang telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT zhangyipeng telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT kongqiusheng telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT wangkeyue telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT jingnaliang telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT lifengfeng telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT geyuebin telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT xionghui telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT wulei telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT xiedeyu telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT fengshengqiu telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT yuxiaqing telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT wangxuekui telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT shushaohua telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa AT meizhinan telomeretotelomerereferencegenomeprovidesgeneticinsightintothepentacyclictriterpenoidbiosynthesisinchaenomelesspeciosa |