Cargando…

Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences

The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assemb...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Yan-Yan, Yang, Jia-Xing, Li, Hong-Kun, Zhao, Hu-Sheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7900419/
https://www.ncbi.nlm.nih.gov/pubmed/33633763
http://dx.doi.org/10.3389/fpls.2021.609729
_version_ 1783654207028461568
author Guo, Yan-Yan
Yang, Jia-Xing
Li, Hong-Kun
Zhao, Hu-Sheng
author_facet Guo, Yan-Yan
Yang, Jia-Xing
Li, Hong-Kun
Zhao, Hu-Sheng
author_sort Guo, Yan-Yan
collection PubMed
description The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assembly, and annotation of the chloroplast genomes of Cypripedium tibeticum and Cypripedium subtropicum. We de novo assembled the chloroplast genomes of the two species with a combination of short-read Illumina data and long-read PacBio data. The plastomes of the two species are characterized by expanded genome size, proliferated AT-rich repeat sequences, low GC content and gene density, as well as low substitution rates of the coding genes. The plastomes of C. tibeticum (197,815 bp) and C. subtropicum (212,668 bp) are substantially larger than those of the three species sequenced in previous studies. The plastome of C. subtropicum is the longest one of Orchidaceae to date. Despite the increase in genome size, the gene order and gene number of the plastomes are conserved, with the exception of an ∼75 kb large inversion in the large single copy (LSC) region shared by the two species. The most striking is the record-setting low GC content in C. subtropicum (28.2%). Moreover, the plastome expansion of the two species is strongly correlated with the proliferation of AT-biased non-coding regions: the non-coding content of C. subtropicum is in excess of 57%. The genus provides a typical example of plastome expansion induced by the expansion of non-coding regions. Considering the pros and cons of different sequencing technologies, we recommend hybrid assembly based on long and short reads applied to the sequencing of plastomes with AT-biased base composition.
format Online
Article
Text
id pubmed-7900419
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79004192021-02-24 Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences Guo, Yan-Yan Yang, Jia-Xing Li, Hong-Kun Zhao, Hu-Sheng Front Plant Sci Plant Science The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assembly, and annotation of the chloroplast genomes of Cypripedium tibeticum and Cypripedium subtropicum. We de novo assembled the chloroplast genomes of the two species with a combination of short-read Illumina data and long-read PacBio data. The plastomes of the two species are characterized by expanded genome size, proliferated AT-rich repeat sequences, low GC content and gene density, as well as low substitution rates of the coding genes. The plastomes of C. tibeticum (197,815 bp) and C. subtropicum (212,668 bp) are substantially larger than those of the three species sequenced in previous studies. The plastome of C. subtropicum is the longest one of Orchidaceae to date. Despite the increase in genome size, the gene order and gene number of the plastomes are conserved, with the exception of an ∼75 kb large inversion in the large single copy (LSC) region shared by the two species. The most striking is the record-setting low GC content in C. subtropicum (28.2%). Moreover, the plastome expansion of the two species is strongly correlated with the proliferation of AT-biased non-coding regions: the non-coding content of C. subtropicum is in excess of 57%. The genus provides a typical example of plastome expansion induced by the expansion of non-coding regions. Considering the pros and cons of different sequencing technologies, we recommend hybrid assembly based on long and short reads applied to the sequencing of plastomes with AT-biased base composition. Frontiers Media S.A. 2021-02-09 /pmc/articles/PMC7900419/ /pubmed/33633763 http://dx.doi.org/10.3389/fpls.2021.609729 Text en Copyright © 2021 Guo, Yang, Li and Zhao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Guo, Yan-Yan
Yang, Jia-Xing
Li, Hong-Kun
Zhao, Hu-Sheng
Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences
title Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences
title_full Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences
title_fullStr Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences
title_full_unstemmed Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences
title_short Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences
title_sort chloroplast genomes of two species of cypripedium: expanded genome size and proliferation of at-biased repeat sequences
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7900419/
https://www.ncbi.nlm.nih.gov/pubmed/33633763
http://dx.doi.org/10.3389/fpls.2021.609729
work_keys_str_mv AT guoyanyan chloroplastgenomesoftwospeciesofcypripediumexpandedgenomesizeandproliferationofatbiasedrepeatsequences
AT yangjiaxing chloroplastgenomesoftwospeciesofcypripediumexpandedgenomesizeandproliferationofatbiasedrepeatsequences
AT lihongkun chloroplastgenomesoftwospeciesofcypripediumexpandedgenomesizeandproliferationofatbiasedrepeatsequences
AT zhaohusheng chloroplastgenomesoftwospeciesofcypripediumexpandedgenomesizeandproliferationofatbiasedrepeatsequences