Cargando…

PacBio full-length transcriptome analysis provides new insights into transcription of chloroplast genomes

Chloroplast and mitochondrial DNA (cpDNA and mtDNA) are apart from nuclear DNA (nuDNA) in a eukaryotic cell. The transcription system of chloroplasts differs from those of mitochondria and eukaryotes. In contrast to nuDNA and animal mtDNA, the transcription of cpDNA is still not well understood, pri...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Jinsong, Yan, Shuangyong, Li, Wenjing, Yang, Xiurong, Cui, Zhongqiu, Li, Junling, Li, Guangsheng, Li, Yuejiao, Hu, Yanping, Gao, Shan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10228390/
https://www.ncbi.nlm.nih.gov/pubmed/37231782
http://dx.doi.org/10.1080/15476286.2023.2214435
_version_ 1785050953311846400
author Shi, Jinsong
Yan, Shuangyong
Li, Wenjing
Yang, Xiurong
Cui, Zhongqiu
Li, Junling
Li, Guangsheng
Li, Yuejiao
Hu, Yanping
Gao, Shan
author_facet Shi, Jinsong
Yan, Shuangyong
Li, Wenjing
Yang, Xiurong
Cui, Zhongqiu
Li, Junling
Li, Guangsheng
Li, Yuejiao
Hu, Yanping
Gao, Shan
author_sort Shi, Jinsong
collection PubMed
description Chloroplast and mitochondrial DNA (cpDNA and mtDNA) are apart from nuclear DNA (nuDNA) in a eukaryotic cell. The transcription system of chloroplasts differs from those of mitochondria and eukaryotes. In contrast to nuDNA and animal mtDNA, the transcription of cpDNA is still not well understood, primarily due to the unresolved identification of transcription initiation sites (TISs) and transcription termination sites (TTSs) on the genome scale. In the present study, we characterized the transcription of chloroplast (cp) genes with greater accuracy and comprehensive information using PacBio full-length transcriptome data from Arabidopsis thaliana. The major findings included the discovery of four types of artifacts, the validation and correction of cp gene annotations, the exact identification of TISs that start with G, and the discovery of polyA-like sites as TTSs. Notably, we proposed a new model to explain cp transcription initiation and termination at the whole-genome level. Four types of artifacts, degraded RNAs and splicing intermediates deserve the attention from researchers working with PacBio full-length transcriptome data, as these contaminant sequences can lead to incorrect downstream analysis. Cp transcription initiates at multiple promoters and terminates at polyA-like sites. Our study provides new insights into cp transcription and new clues to study the evolution of promoters, TISs, TTSs and polyA tails of eukaryotic genes.
format Online
Article
Text
id pubmed-10228390
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-102283902023-05-31 PacBio full-length transcriptome analysis provides new insights into transcription of chloroplast genomes Shi, Jinsong Yan, Shuangyong Li, Wenjing Yang, Xiurong Cui, Zhongqiu Li, Junling Li, Guangsheng Li, Yuejiao Hu, Yanping Gao, Shan RNA Biol Research Paper Chloroplast and mitochondrial DNA (cpDNA and mtDNA) are apart from nuclear DNA (nuDNA) in a eukaryotic cell. The transcription system of chloroplasts differs from those of mitochondria and eukaryotes. In contrast to nuDNA and animal mtDNA, the transcription of cpDNA is still not well understood, primarily due to the unresolved identification of transcription initiation sites (TISs) and transcription termination sites (TTSs) on the genome scale. In the present study, we characterized the transcription of chloroplast (cp) genes with greater accuracy and comprehensive information using PacBio full-length transcriptome data from Arabidopsis thaliana. The major findings included the discovery of four types of artifacts, the validation and correction of cp gene annotations, the exact identification of TISs that start with G, and the discovery of polyA-like sites as TTSs. Notably, we proposed a new model to explain cp transcription initiation and termination at the whole-genome level. Four types of artifacts, degraded RNAs and splicing intermediates deserve the attention from researchers working with PacBio full-length transcriptome data, as these contaminant sequences can lead to incorrect downstream analysis. Cp transcription initiates at multiple promoters and terminates at polyA-like sites. Our study provides new insights into cp transcription and new clues to study the evolution of promoters, TISs, TTSs and polyA tails of eukaryotic genes. Taylor & Francis 2023-05-25 /pmc/articles/PMC10228390/ /pubmed/37231782 http://dx.doi.org/10.1080/15476286.2023.2214435 Text en © 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent.
spellingShingle Research Paper
Shi, Jinsong
Yan, Shuangyong
Li, Wenjing
Yang, Xiurong
Cui, Zhongqiu
Li, Junling
Li, Guangsheng
Li, Yuejiao
Hu, Yanping
Gao, Shan
PacBio full-length transcriptome analysis provides new insights into transcription of chloroplast genomes
title PacBio full-length transcriptome analysis provides new insights into transcription of chloroplast genomes
title_full PacBio full-length transcriptome analysis provides new insights into transcription of chloroplast genomes
title_fullStr PacBio full-length transcriptome analysis provides new insights into transcription of chloroplast genomes
title_full_unstemmed PacBio full-length transcriptome analysis provides new insights into transcription of chloroplast genomes
title_short PacBio full-length transcriptome analysis provides new insights into transcription of chloroplast genomes
title_sort pacbio full-length transcriptome analysis provides new insights into transcription of chloroplast genomes
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10228390/
https://www.ncbi.nlm.nih.gov/pubmed/37231782
http://dx.doi.org/10.1080/15476286.2023.2214435
work_keys_str_mv AT shijinsong pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes
AT yanshuangyong pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes
AT liwenjing pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes
AT yangxiurong pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes
AT cuizhongqiu pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes
AT lijunling pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes
AT liguangsheng pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes
AT liyuejiao pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes
AT huyanping pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes
AT gaoshan pacbiofulllengthtranscriptomeanalysisprovidesnewinsightsintotranscriptionofchloroplastgenomes