Cargando…

A High-Quality Reference Genome Assembly of Prinsepia uniflora (Rosaceae)

This study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species Prinsepia uniflora, a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boasting contig and super-...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Lei, Zhang, Chaopan, An, Yajing, Zhu, Qiang, Wang, Mingcheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10671140/
https://www.ncbi.nlm.nih.gov/pubmed/38002978
http://dx.doi.org/10.3390/genes14112035
_version_ 1785140084698251264
author Zhang, Lei
Zhang, Chaopan
An, Yajing
Zhu, Qiang
Wang, Mingcheng
author_facet Zhang, Lei
Zhang, Chaopan
An, Yajing
Zhu, Qiang
Wang, Mingcheng
author_sort Zhang, Lei
collection PubMed
description This study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species Prinsepia uniflora, a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boasting contig and super-scaffold N50 values of 2.77 and 79.32 Mb, respectively. Annotated within this genome is a substantial 875.99 Mb of repetitive sequences, with transposable elements occupying 777.28 Mb, constituting 61.07% of the entire genome. Our predictive efforts identified 49,261 protein-coding genes within the repeat-masked assembly, with 45,256 (91.87%) having functional annotations, 5127 (10.41%) demonstrating tandem duplication, and 2373 (4.82%) classified as transcription factor genes. Additionally, our investigation unveiled 3080 non-coding RNAs spanning 0.51 Mb of the genome sequences. According to our evolutionary study, P. uniflora underwent recent whole-genome duplication following its separation from Prunus salicina. The presented reference-level genome assembly and annotation for P. uniflora will significantly facilitate the in-depth exploration of genomic information pertaining to this species, offering substantial utility in comparative genomics and evolutionary analyses involving Rosaceae species.
format Online
Article
Text
id pubmed-10671140
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106711402023-11-02 A High-Quality Reference Genome Assembly of Prinsepia uniflora (Rosaceae) Zhang, Lei Zhang, Chaopan An, Yajing Zhu, Qiang Wang, Mingcheng Genes (Basel) Brief Report This study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species Prinsepia uniflora, a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boasting contig and super-scaffold N50 values of 2.77 and 79.32 Mb, respectively. Annotated within this genome is a substantial 875.99 Mb of repetitive sequences, with transposable elements occupying 777.28 Mb, constituting 61.07% of the entire genome. Our predictive efforts identified 49,261 protein-coding genes within the repeat-masked assembly, with 45,256 (91.87%) having functional annotations, 5127 (10.41%) demonstrating tandem duplication, and 2373 (4.82%) classified as transcription factor genes. Additionally, our investigation unveiled 3080 non-coding RNAs spanning 0.51 Mb of the genome sequences. According to our evolutionary study, P. uniflora underwent recent whole-genome duplication following its separation from Prunus salicina. The presented reference-level genome assembly and annotation for P. uniflora will significantly facilitate the in-depth exploration of genomic information pertaining to this species, offering substantial utility in comparative genomics and evolutionary analyses involving Rosaceae species. MDPI 2023-11-02 /pmc/articles/PMC10671140/ /pubmed/38002978 http://dx.doi.org/10.3390/genes14112035 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Brief Report
Zhang, Lei
Zhang, Chaopan
An, Yajing
Zhu, Qiang
Wang, Mingcheng
A High-Quality Reference Genome Assembly of Prinsepia uniflora (Rosaceae)
title A High-Quality Reference Genome Assembly of Prinsepia uniflora (Rosaceae)
title_full A High-Quality Reference Genome Assembly of Prinsepia uniflora (Rosaceae)
title_fullStr A High-Quality Reference Genome Assembly of Prinsepia uniflora (Rosaceae)
title_full_unstemmed A High-Quality Reference Genome Assembly of Prinsepia uniflora (Rosaceae)
title_short A High-Quality Reference Genome Assembly of Prinsepia uniflora (Rosaceae)
title_sort high-quality reference genome assembly of prinsepia uniflora (rosaceae)
topic Brief Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10671140/
https://www.ncbi.nlm.nih.gov/pubmed/38002978
http://dx.doi.org/10.3390/genes14112035
work_keys_str_mv AT zhanglei ahighqualityreferencegenomeassemblyofprinsepiauniflorarosaceae
AT zhangchaopan ahighqualityreferencegenomeassemblyofprinsepiauniflorarosaceae
AT anyajing ahighqualityreferencegenomeassemblyofprinsepiauniflorarosaceae
AT zhuqiang ahighqualityreferencegenomeassemblyofprinsepiauniflorarosaceae
AT wangmingcheng ahighqualityreferencegenomeassemblyofprinsepiauniflorarosaceae
AT zhanglei highqualityreferencegenomeassemblyofprinsepiauniflorarosaceae
AT zhangchaopan highqualityreferencegenomeassemblyofprinsepiauniflorarosaceae
AT anyajing highqualityreferencegenomeassemblyofprinsepiauniflorarosaceae
AT zhuqiang highqualityreferencegenomeassemblyofprinsepiauniflorarosaceae
AT wangmingcheng highqualityreferencegenomeassemblyofprinsepiauniflorarosaceae