Cargando…

The complete chloroplast genome of greater duckweed (Spirodela polyrhiza 7498) using PacBio long reads: insights into the chloroplast evolution and transcription regulation

BACKGROUND: Duckweeds (Lemnaceae) are aquatic plants distributed all over the world. The chloroplast genome, as an efficient solar-powered reactor, is an invaluable resource to study biodiversity and to carry foreign genes. The chloroplast genome sequencing has become routine and less expensive with...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yating, An, Dong, Li, Changsheng, Zhao, Zhixuan, Wang, Wenqin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6986005/
https://www.ncbi.nlm.nih.gov/pubmed/31992185
http://dx.doi.org/10.1186/s12864-020-6499-y
_version_ 1783491893974269952
author Zhang, Yating
An, Dong
Li, Changsheng
Zhao, Zhixuan
Wang, Wenqin
author_facet Zhang, Yating
An, Dong
Li, Changsheng
Zhao, Zhixuan
Wang, Wenqin
author_sort Zhang, Yating
collection PubMed
description BACKGROUND: Duckweeds (Lemnaceae) are aquatic plants distributed all over the world. The chloroplast genome, as an efficient solar-powered reactor, is an invaluable resource to study biodiversity and to carry foreign genes. The chloroplast genome sequencing has become routine and less expensive with the delivery of high-throughput sequencing technologies, allowing us to deeply investigate genomics and transcriptomics of duckweed organelles. RESULTS: Here, the complete chloroplast genome of Spirodela polyrhiza 7498 (SpV2) is assembled by PacBio sequencing. The length of 168,956 bp circular genome is composed of a pair of inverted repeats of 31,844 bp, a large single copy of 91,210 bp and a small single copy of 14,058 bp. Compared to the previous version (SpV1) assembled from short reads, the integrity and quality of SpV2 are improved, especially with the retrieval of two repeated fragments in ycf2 gene. There are a number of 107 unique genes, including 78 protein-coding genes, 25 tRNA genes and 4 rRNA genes. With the evidence of full-length cDNAs generated from PacBio isoform sequencing, seven genes (ycf3, clpP, atpF, rpoC1, rpl2, rps12 and ndhA) are detected to contain type-II introns. The ndhA intron has 50% more sequence divergence than the species-barcoding marker of atpF-atpH, showing the potential power to discriminate close species. A number of 37 RNA editing sites are recognized to have cytosine (C) to uracil (U) substitutions, eight of which are newly defined including six from the intergenic regions and two from the coding sequences of rpoC2 and ndhA genes. In addition, nine operon classes are identified using transcriptomic data. It is found that the operons contain multiple subunit genes encoding the same functional complexes comprising of ATP synthase, photosynthesis system, ribosomal proteins, et.al., which could be simultaneously transcribed and coordinately translated in response to the cell stimuli. CONCLUSIONS: The understanding of the chloroplast genomics and the transcriptomics of S.polyrhiza would greatly facilitate the study of phylogenetic evolution and the application of genetically engineering duckweeds.
format Online
Article
Text
id pubmed-6986005
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69860052020-01-30 The complete chloroplast genome of greater duckweed (Spirodela polyrhiza 7498) using PacBio long reads: insights into the chloroplast evolution and transcription regulation Zhang, Yating An, Dong Li, Changsheng Zhao, Zhixuan Wang, Wenqin BMC Genomics Research Article BACKGROUND: Duckweeds (Lemnaceae) are aquatic plants distributed all over the world. The chloroplast genome, as an efficient solar-powered reactor, is an invaluable resource to study biodiversity and to carry foreign genes. The chloroplast genome sequencing has become routine and less expensive with the delivery of high-throughput sequencing technologies, allowing us to deeply investigate genomics and transcriptomics of duckweed organelles. RESULTS: Here, the complete chloroplast genome of Spirodela polyrhiza 7498 (SpV2) is assembled by PacBio sequencing. The length of 168,956 bp circular genome is composed of a pair of inverted repeats of 31,844 bp, a large single copy of 91,210 bp and a small single copy of 14,058 bp. Compared to the previous version (SpV1) assembled from short reads, the integrity and quality of SpV2 are improved, especially with the retrieval of two repeated fragments in ycf2 gene. There are a number of 107 unique genes, including 78 protein-coding genes, 25 tRNA genes and 4 rRNA genes. With the evidence of full-length cDNAs generated from PacBio isoform sequencing, seven genes (ycf3, clpP, atpF, rpoC1, rpl2, rps12 and ndhA) are detected to contain type-II introns. The ndhA intron has 50% more sequence divergence than the species-barcoding marker of atpF-atpH, showing the potential power to discriminate close species. A number of 37 RNA editing sites are recognized to have cytosine (C) to uracil (U) substitutions, eight of which are newly defined including six from the intergenic regions and two from the coding sequences of rpoC2 and ndhA genes. In addition, nine operon classes are identified using transcriptomic data. It is found that the operons contain multiple subunit genes encoding the same functional complexes comprising of ATP synthase, photosynthesis system, ribosomal proteins, et.al., which could be simultaneously transcribed and coordinately translated in response to the cell stimuli. CONCLUSIONS: The understanding of the chloroplast genomics and the transcriptomics of S.polyrhiza would greatly facilitate the study of phylogenetic evolution and the application of genetically engineering duckweeds. BioMed Central 2020-01-28 /pmc/articles/PMC6986005/ /pubmed/31992185 http://dx.doi.org/10.1186/s12864-020-6499-y Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Zhang, Yating
An, Dong
Li, Changsheng
Zhao, Zhixuan
Wang, Wenqin
The complete chloroplast genome of greater duckweed (Spirodela polyrhiza 7498) using PacBio long reads: insights into the chloroplast evolution and transcription regulation
title The complete chloroplast genome of greater duckweed (Spirodela polyrhiza 7498) using PacBio long reads: insights into the chloroplast evolution and transcription regulation
title_full The complete chloroplast genome of greater duckweed (Spirodela polyrhiza 7498) using PacBio long reads: insights into the chloroplast evolution and transcription regulation
title_fullStr The complete chloroplast genome of greater duckweed (Spirodela polyrhiza 7498) using PacBio long reads: insights into the chloroplast evolution and transcription regulation
title_full_unstemmed The complete chloroplast genome of greater duckweed (Spirodela polyrhiza 7498) using PacBio long reads: insights into the chloroplast evolution and transcription regulation
title_short The complete chloroplast genome of greater duckweed (Spirodela polyrhiza 7498) using PacBio long reads: insights into the chloroplast evolution and transcription regulation
title_sort complete chloroplast genome of greater duckweed (spirodela polyrhiza 7498) using pacbio long reads: insights into the chloroplast evolution and transcription regulation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6986005/
https://www.ncbi.nlm.nih.gov/pubmed/31992185
http://dx.doi.org/10.1186/s12864-020-6499-y
work_keys_str_mv AT zhangyating thecompletechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation
AT andong thecompletechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation
AT lichangsheng thecompletechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation
AT zhaozhixuan thecompletechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation
AT wangwenqin thecompletechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation
AT zhangyating completechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation
AT andong completechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation
AT lichangsheng completechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation
AT zhaozhixuan completechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation
AT wangwenqin completechloroplastgenomeofgreaterduckweedspirodelapolyrhiza7498usingpacbiolongreadsinsightsintothechloroplastevolutionandtranscriptionregulation