Cargando…
Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences
The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assemb...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7900419/ https://www.ncbi.nlm.nih.gov/pubmed/33633763 http://dx.doi.org/10.3389/fpls.2021.609729 |
_version_ | 1783654207028461568 |
---|---|
author | Guo, Yan-Yan Yang, Jia-Xing Li, Hong-Kun Zhao, Hu-Sheng |
author_facet | Guo, Yan-Yan Yang, Jia-Xing Li, Hong-Kun Zhao, Hu-Sheng |
author_sort | Guo, Yan-Yan |
collection | PubMed |
description | The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assembly, and annotation of the chloroplast genomes of Cypripedium tibeticum and Cypripedium subtropicum. We de novo assembled the chloroplast genomes of the two species with a combination of short-read Illumina data and long-read PacBio data. The plastomes of the two species are characterized by expanded genome size, proliferated AT-rich repeat sequences, low GC content and gene density, as well as low substitution rates of the coding genes. The plastomes of C. tibeticum (197,815 bp) and C. subtropicum (212,668 bp) are substantially larger than those of the three species sequenced in previous studies. The plastome of C. subtropicum is the longest one of Orchidaceae to date. Despite the increase in genome size, the gene order and gene number of the plastomes are conserved, with the exception of an ∼75 kb large inversion in the large single copy (LSC) region shared by the two species. The most striking is the record-setting low GC content in C. subtropicum (28.2%). Moreover, the plastome expansion of the two species is strongly correlated with the proliferation of AT-biased non-coding regions: the non-coding content of C. subtropicum is in excess of 57%. The genus provides a typical example of plastome expansion induced by the expansion of non-coding regions. Considering the pros and cons of different sequencing technologies, we recommend hybrid assembly based on long and short reads applied to the sequencing of plastomes with AT-biased base composition. |
format | Online Article Text |
id | pubmed-7900419 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79004192021-02-24 Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences Guo, Yan-Yan Yang, Jia-Xing Li, Hong-Kun Zhao, Hu-Sheng Front Plant Sci Plant Science The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assembly, and annotation of the chloroplast genomes of Cypripedium tibeticum and Cypripedium subtropicum. We de novo assembled the chloroplast genomes of the two species with a combination of short-read Illumina data and long-read PacBio data. The plastomes of the two species are characterized by expanded genome size, proliferated AT-rich repeat sequences, low GC content and gene density, as well as low substitution rates of the coding genes. The plastomes of C. tibeticum (197,815 bp) and C. subtropicum (212,668 bp) are substantially larger than those of the three species sequenced in previous studies. The plastome of C. subtropicum is the longest one of Orchidaceae to date. Despite the increase in genome size, the gene order and gene number of the plastomes are conserved, with the exception of an ∼75 kb large inversion in the large single copy (LSC) region shared by the two species. The most striking is the record-setting low GC content in C. subtropicum (28.2%). Moreover, the plastome expansion of the two species is strongly correlated with the proliferation of AT-biased non-coding regions: the non-coding content of C. subtropicum is in excess of 57%. The genus provides a typical example of plastome expansion induced by the expansion of non-coding regions. Considering the pros and cons of different sequencing technologies, we recommend hybrid assembly based on long and short reads applied to the sequencing of plastomes with AT-biased base composition. Frontiers Media S.A. 2021-02-09 /pmc/articles/PMC7900419/ /pubmed/33633763 http://dx.doi.org/10.3389/fpls.2021.609729 Text en Copyright © 2021 Guo, Yang, Li and Zhao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Guo, Yan-Yan Yang, Jia-Xing Li, Hong-Kun Zhao, Hu-Sheng Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences |
title | Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences |
title_full | Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences |
title_fullStr | Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences |
title_full_unstemmed | Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences |
title_short | Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences |
title_sort | chloroplast genomes of two species of cypripedium: expanded genome size and proliferation of at-biased repeat sequences |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7900419/ https://www.ncbi.nlm.nih.gov/pubmed/33633763 http://dx.doi.org/10.3389/fpls.2021.609729 |
work_keys_str_mv | AT guoyanyan chloroplastgenomesoftwospeciesofcypripediumexpandedgenomesizeandproliferationofatbiasedrepeatsequences AT yangjiaxing chloroplastgenomesoftwospeciesofcypripediumexpandedgenomesizeandproliferationofatbiasedrepeatsequences AT lihongkun chloroplastgenomesoftwospeciesofcypripediumexpandedgenomesizeandproliferationofatbiasedrepeatsequences AT zhaohusheng chloroplastgenomesoftwospeciesofcypripediumexpandedgenomesizeandproliferationofatbiasedrepeatsequences |