Cargando…

Improved assembly and annotation of the sesame genome

Sesame (Sesamum indicum L.) is an important oilseed crop that produces abundant seed oil and has a pleasant flavor and high nutritional value. To date, several Illumina-based genome assemblies corresponding to different sesame genotypes have been published and widely used in genetic and genomic stud...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Mingcheng, Huang, Jianwei, Liu, Song, Liu, Xiaofeng, Li, Rui, Luo, Junjia, Fu, Zhixi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9724774/
https://www.ncbi.nlm.nih.gov/pubmed/36355766
http://dx.doi.org/10.1093/dnares/dsac041
_version_ 1784844487677181952
author Wang, Mingcheng
Huang, Jianwei
Liu, Song
Liu, Xiaofeng
Li, Rui
Luo, Junjia
Fu, Zhixi
author_facet Wang, Mingcheng
Huang, Jianwei
Liu, Song
Liu, Xiaofeng
Li, Rui
Luo, Junjia
Fu, Zhixi
author_sort Wang, Mingcheng
collection PubMed
description Sesame (Sesamum indicum L.) is an important oilseed crop that produces abundant seed oil and has a pleasant flavor and high nutritional value. To date, several Illumina-based genome assemblies corresponding to different sesame genotypes have been published and widely used in genetic and genomic studies of sesame. However, these assemblies consistently showed low continuity with numerous gaps. Here, we reported a high-quality, reference-level sesame genome assembly by integrating PacBio high-fidelity sequencing and Hi-C technology. Our updated sesame assembly was 309.35 Mb in size with a high chromosome anchoring rate (97.54%) and contig N50 size (13.48 Mb), which were better than previously published genomes. We identified 163.38 Mb repetitive elements and 24,345 high-confidence protein-coding genes in the updated sesame assembly. Comparative genomic analysis showed that sesame shared an ancient whole-genome duplication event with two Lamiales species. A total of 2,782 genes were tandemly duplicated. We also identified several genes that were likely involved in fatty acid and triacylglycerol biosynthesis. Our improved sesame assembly and annotation will facilitate future genetic studies and genomics-assisted breeding of sesame.
format Online
Article
Text
id pubmed-9724774
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97247742022-12-07 Improved assembly and annotation of the sesame genome Wang, Mingcheng Huang, Jianwei Liu, Song Liu, Xiaofeng Li, Rui Luo, Junjia Fu, Zhixi DNA Res Resource Article: Genomes Explored Sesame (Sesamum indicum L.) is an important oilseed crop that produces abundant seed oil and has a pleasant flavor and high nutritional value. To date, several Illumina-based genome assemblies corresponding to different sesame genotypes have been published and widely used in genetic and genomic studies of sesame. However, these assemblies consistently showed low continuity with numerous gaps. Here, we reported a high-quality, reference-level sesame genome assembly by integrating PacBio high-fidelity sequencing and Hi-C technology. Our updated sesame assembly was 309.35 Mb in size with a high chromosome anchoring rate (97.54%) and contig N50 size (13.48 Mb), which were better than previously published genomes. We identified 163.38 Mb repetitive elements and 24,345 high-confidence protein-coding genes in the updated sesame assembly. Comparative genomic analysis showed that sesame shared an ancient whole-genome duplication event with two Lamiales species. A total of 2,782 genes were tandemly duplicated. We also identified several genes that were likely involved in fatty acid and triacylglycerol biosynthesis. Our improved sesame assembly and annotation will facilitate future genetic studies and genomics-assisted breeding of sesame. Oxford University Press 2022-11-10 /pmc/articles/PMC9724774/ /pubmed/36355766 http://dx.doi.org/10.1093/dnares/dsac041 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Kazusa DNA Research Institute. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Resource Article: Genomes Explored
Wang, Mingcheng
Huang, Jianwei
Liu, Song
Liu, Xiaofeng
Li, Rui
Luo, Junjia
Fu, Zhixi
Improved assembly and annotation of the sesame genome
title Improved assembly and annotation of the sesame genome
title_full Improved assembly and annotation of the sesame genome
title_fullStr Improved assembly and annotation of the sesame genome
title_full_unstemmed Improved assembly and annotation of the sesame genome
title_short Improved assembly and annotation of the sesame genome
title_sort improved assembly and annotation of the sesame genome
topic Resource Article: Genomes Explored
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9724774/
https://www.ncbi.nlm.nih.gov/pubmed/36355766
http://dx.doi.org/10.1093/dnares/dsac041
work_keys_str_mv AT wangmingcheng improvedassemblyandannotationofthesesamegenome
AT huangjianwei improvedassemblyandannotationofthesesamegenome
AT liusong improvedassemblyandannotationofthesesamegenome
AT liuxiaofeng improvedassemblyandannotationofthesesamegenome
AT lirui improvedassemblyandannotationofthesesamegenome
AT luojunjia improvedassemblyandannotationofthesesamegenome
AT fuzhixi improvedassemblyandannotationofthesesamegenome