Cargando…
A chromosome-scale genome assembly of cucumber (Cucumis sativus L.)
BACKGROUND: Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assem...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6582320/ https://www.ncbi.nlm.nih.gov/pubmed/31216035 http://dx.doi.org/10.1093/gigascience/giz072 |
_version_ | 1783428299142201344 |
---|---|
author | Li, Qing Li, Hongbo Huang, Wu Xu, Yuanchao Zhou, Qian Wang, Shenhao Ruan, Jue Huang, Sanwen Zhang, Zhonghua |
author_facet | Li, Qing Li, Hongbo Huang, Wu Xu, Yuanchao Zhou, Qian Wang, Shenhao Ruan, Jue Huang, Sanwen Zhang, Zhonghua |
author_sort | Li, Qing |
collection | PubMed |
description | BACKGROUND: Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome assembly is therefore essential. FINDINGS: We assembled single-molecule real-time (SMRT) long reads to generate an improved cucumber reference genome. This version contains 174 contigs with a total length of 226.2 Mb and an N50 of 8.9 Mb, and provides 29.0 Mb more sequence data than previous versions. Using 10X Genomics and high-throughput chromosome conformation capture (Hi-C) data, 89 contigs (∼211.0 Mb) were directly linked into 7 pseudo-chromosome sequences. The newly assembled regions show much higher guanine-cytosine or adenine-thymine content than found previously, which is likely to have been inaccessible to Illumina sequencing. The new assembly contains 1,374 full-length long terminal retrotransposons and 1,078 novel genes including 239 tandemly duplicated genes. For example, we found 4 tandemly duplicated tyrosylprotein sulfotransferases, in contrast to the single copy of the gene found previously and in most other plants. CONCLUSION: This high-quality genome presents novel features of the cucumber genome and will serve as a valuable resource for genetic research in cucumber and plant comparative genomics. |
format | Online Article Text |
id | pubmed-6582320 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-65823202019-06-21 A chromosome-scale genome assembly of cucumber (Cucumis sativus L.) Li, Qing Li, Hongbo Huang, Wu Xu, Yuanchao Zhou, Qian Wang, Shenhao Ruan, Jue Huang, Sanwen Zhang, Zhonghua Gigascience Data Note BACKGROUND: Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome assembly is therefore essential. FINDINGS: We assembled single-molecule real-time (SMRT) long reads to generate an improved cucumber reference genome. This version contains 174 contigs with a total length of 226.2 Mb and an N50 of 8.9 Mb, and provides 29.0 Mb more sequence data than previous versions. Using 10X Genomics and high-throughput chromosome conformation capture (Hi-C) data, 89 contigs (∼211.0 Mb) were directly linked into 7 pseudo-chromosome sequences. The newly assembled regions show much higher guanine-cytosine or adenine-thymine content than found previously, which is likely to have been inaccessible to Illumina sequencing. The new assembly contains 1,374 full-length long terminal retrotransposons and 1,078 novel genes including 239 tandemly duplicated genes. For example, we found 4 tandemly duplicated tyrosylprotein sulfotransferases, in contrast to the single copy of the gene found previously and in most other plants. CONCLUSION: This high-quality genome presents novel features of the cucumber genome and will serve as a valuable resource for genetic research in cucumber and plant comparative genomics. Oxford University Press 2019-06-18 /pmc/articles/PMC6582320/ /pubmed/31216035 http://dx.doi.org/10.1093/gigascience/giz072 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Data Note Li, Qing Li, Hongbo Huang, Wu Xu, Yuanchao Zhou, Qian Wang, Shenhao Ruan, Jue Huang, Sanwen Zhang, Zhonghua A chromosome-scale genome assembly of cucumber (Cucumis sativus L.) |
title | A chromosome-scale genome assembly of cucumber (Cucumis sativus L.) |
title_full | A chromosome-scale genome assembly of cucumber (Cucumis sativus L.) |
title_fullStr | A chromosome-scale genome assembly of cucumber (Cucumis sativus L.) |
title_full_unstemmed | A chromosome-scale genome assembly of cucumber (Cucumis sativus L.) |
title_short | A chromosome-scale genome assembly of cucumber (Cucumis sativus L.) |
title_sort | chromosome-scale genome assembly of cucumber (cucumis sativus l.) |
topic | Data Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6582320/ https://www.ncbi.nlm.nih.gov/pubmed/31216035 http://dx.doi.org/10.1093/gigascience/giz072 |
work_keys_str_mv | AT liqing achromosomescalegenomeassemblyofcucumbercucumissativusl AT lihongbo achromosomescalegenomeassemblyofcucumbercucumissativusl AT huangwu achromosomescalegenomeassemblyofcucumbercucumissativusl AT xuyuanchao achromosomescalegenomeassemblyofcucumbercucumissativusl AT zhouqian achromosomescalegenomeassemblyofcucumbercucumissativusl AT wangshenhao achromosomescalegenomeassemblyofcucumbercucumissativusl AT ruanjue achromosomescalegenomeassemblyofcucumbercucumissativusl AT huangsanwen achromosomescalegenomeassemblyofcucumbercucumissativusl AT zhangzhonghua achromosomescalegenomeassemblyofcucumbercucumissativusl AT liqing chromosomescalegenomeassemblyofcucumbercucumissativusl AT lihongbo chromosomescalegenomeassemblyofcucumbercucumissativusl AT huangwu chromosomescalegenomeassemblyofcucumbercucumissativusl AT xuyuanchao chromosomescalegenomeassemblyofcucumbercucumissativusl AT zhouqian chromosomescalegenomeassemblyofcucumbercucumissativusl AT wangshenhao chromosomescalegenomeassemblyofcucumbercucumissativusl AT ruanjue chromosomescalegenomeassemblyofcucumbercucumissativusl AT huangsanwen chromosomescalegenomeassemblyofcucumbercucumissativusl AT zhangzhonghua chromosomescalegenomeassemblyofcucumbercucumissativusl |