Cargando…

A chromosome-scale genome assembly of cucumber (Cucumis sativus L.)

BACKGROUND: Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assem...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Qing, Li, Hongbo, Huang, Wu, Xu, Yuanchao, Zhou, Qian, Wang, Shenhao, Ruan, Jue, Huang, Sanwen, Zhang, Zhonghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6582320/
https://www.ncbi.nlm.nih.gov/pubmed/31216035
http://dx.doi.org/10.1093/gigascience/giz072
_version_ 1783428299142201344
author Li, Qing
Li, Hongbo
Huang, Wu
Xu, Yuanchao
Zhou, Qian
Wang, Shenhao
Ruan, Jue
Huang, Sanwen
Zhang, Zhonghua
author_facet Li, Qing
Li, Hongbo
Huang, Wu
Xu, Yuanchao
Zhou, Qian
Wang, Shenhao
Ruan, Jue
Huang, Sanwen
Zhang, Zhonghua
author_sort Li, Qing
collection PubMed
description BACKGROUND: Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome assembly is therefore essential. FINDINGS: We assembled single-molecule real-time (SMRT) long reads to generate an improved cucumber reference genome. This version contains 174 contigs with a total length of 226.2 Mb and an N50 of 8.9 Mb, and provides 29.0 Mb more sequence data than previous versions. Using 10X Genomics and high-throughput chromosome conformation capture (Hi-C) data, 89 contigs (∼211.0 Mb) were directly linked into 7 pseudo-chromosome sequences. The newly assembled regions show much higher guanine-cytosine or adenine-thymine content than found previously, which is likely to have been inaccessible to Illumina sequencing. The new assembly contains 1,374 full-length long terminal retrotransposons and 1,078 novel genes including 239 tandemly duplicated genes. For example, we found 4 tandemly duplicated tyrosylprotein sulfotransferases, in contrast to the single copy of the gene found previously and in most other plants. CONCLUSION: This high-quality genome presents novel features of the cucumber genome and will serve as a valuable resource for genetic research in cucumber and plant comparative genomics.
format Online
Article
Text
id pubmed-6582320
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-65823202019-06-21 A chromosome-scale genome assembly of cucumber (Cucumis sativus L.) Li, Qing Li, Hongbo Huang, Wu Xu, Yuanchao Zhou, Qian Wang, Shenhao Ruan, Jue Huang, Sanwen Zhang, Zhonghua Gigascience Data Note BACKGROUND: Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome assembly is therefore essential. FINDINGS: We assembled single-molecule real-time (SMRT) long reads to generate an improved cucumber reference genome. This version contains 174 contigs with a total length of 226.2 Mb and an N50 of 8.9 Mb, and provides 29.0 Mb more sequence data than previous versions. Using 10X Genomics and high-throughput chromosome conformation capture (Hi-C) data, 89 contigs (∼211.0 Mb) were directly linked into 7 pseudo-chromosome sequences. The newly assembled regions show much higher guanine-cytosine or adenine-thymine content than found previously, which is likely to have been inaccessible to Illumina sequencing. The new assembly contains 1,374 full-length long terminal retrotransposons and 1,078 novel genes including 239 tandemly duplicated genes. For example, we found 4 tandemly duplicated tyrosylprotein sulfotransferases, in contrast to the single copy of the gene found previously and in most other plants. CONCLUSION: This high-quality genome presents novel features of the cucumber genome and will serve as a valuable resource for genetic research in cucumber and plant comparative genomics. Oxford University Press 2019-06-18 /pmc/articles/PMC6582320/ /pubmed/31216035 http://dx.doi.org/10.1093/gigascience/giz072 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Li, Qing
Li, Hongbo
Huang, Wu
Xu, Yuanchao
Zhou, Qian
Wang, Shenhao
Ruan, Jue
Huang, Sanwen
Zhang, Zhonghua
A chromosome-scale genome assembly of cucumber (Cucumis sativus L.)
title A chromosome-scale genome assembly of cucumber (Cucumis sativus L.)
title_full A chromosome-scale genome assembly of cucumber (Cucumis sativus L.)
title_fullStr A chromosome-scale genome assembly of cucumber (Cucumis sativus L.)
title_full_unstemmed A chromosome-scale genome assembly of cucumber (Cucumis sativus L.)
title_short A chromosome-scale genome assembly of cucumber (Cucumis sativus L.)
title_sort chromosome-scale genome assembly of cucumber (cucumis sativus l.)
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6582320/
https://www.ncbi.nlm.nih.gov/pubmed/31216035
http://dx.doi.org/10.1093/gigascience/giz072
work_keys_str_mv AT liqing achromosomescalegenomeassemblyofcucumbercucumissativusl
AT lihongbo achromosomescalegenomeassemblyofcucumbercucumissativusl
AT huangwu achromosomescalegenomeassemblyofcucumbercucumissativusl
AT xuyuanchao achromosomescalegenomeassemblyofcucumbercucumissativusl
AT zhouqian achromosomescalegenomeassemblyofcucumbercucumissativusl
AT wangshenhao achromosomescalegenomeassemblyofcucumbercucumissativusl
AT ruanjue achromosomescalegenomeassemblyofcucumbercucumissativusl
AT huangsanwen achromosomescalegenomeassemblyofcucumbercucumissativusl
AT zhangzhonghua achromosomescalegenomeassemblyofcucumbercucumissativusl
AT liqing chromosomescalegenomeassemblyofcucumbercucumissativusl
AT lihongbo chromosomescalegenomeassemblyofcucumbercucumissativusl
AT huangwu chromosomescalegenomeassemblyofcucumbercucumissativusl
AT xuyuanchao chromosomescalegenomeassemblyofcucumbercucumissativusl
AT zhouqian chromosomescalegenomeassemblyofcucumbercucumissativusl
AT wangshenhao chromosomescalegenomeassemblyofcucumbercucumissativusl
AT ruanjue chromosomescalegenomeassemblyofcucumbercucumissativusl
AT huangsanwen chromosomescalegenomeassemblyofcucumbercucumissativusl
AT zhangzhonghua chromosomescalegenomeassemblyofcucumbercucumissativusl