Cargando…

Revealing the novel complexity of plant long non-coding RNA by strand-specific and whole transcriptome sequencing for evolutionarily representative plant species

BACKGROUND: Previous studies on plant long noncoding RNAs (lncRNAs) lacked consistency and suffered from many factors like heterogeneous data sources and experimental protocols, different plant tissues, inconsistent bioinformatics pipelines, etc. For example, the sequencing of RNAs with poly(A) tail...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yan, Chen, Longxian, Hong, Xiangna, Shi, Han, Li, Xuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9118565/
https://www.ncbi.nlm.nih.gov/pubmed/35590257
http://dx.doi.org/10.1186/s12864-022-08602-9
_version_ 1784710523201257472
author Zhu, Yan
Chen, Longxian
Hong, Xiangna
Shi, Han
Li, Xuan
author_facet Zhu, Yan
Chen, Longxian
Hong, Xiangna
Shi, Han
Li, Xuan
author_sort Zhu, Yan
collection PubMed
description BACKGROUND: Previous studies on plant long noncoding RNAs (lncRNAs) lacked consistency and suffered from many factors like heterogeneous data sources and experimental protocols, different plant tissues, inconsistent bioinformatics pipelines, etc. For example, the sequencing of RNAs with poly(A) tails excluded a large portion of lncRNAs without poly(A), and use of regular RNA-sequencing technique did not distinguish transcripts’ direction for lncRNAs. The current study was designed to systematically discover and analyze lncRNAs across eight evolutionarily representative plant species, using strand-specific (directional) and whole transcriptome sequencing (RiboMinus) technique. RESULTS: A total of 39,945 lncRNAs (25,350 lincRNAs and 14,595 lncNATs) were identified, which showed molecular features of lncRNAs that are consistent across divergent plant species but different from those of mRNA. Further, transposable elements (TEs) were found to play key roles in the origination of lncRNA, as significantly large number of lncRNAs were found to contain TEs in gene body and promoter region, and transcription of many lncRNAs was driven by TE promoters. The lncRNA sequences were divergent even in closely related species, and most plant lncRNAs were genus/species-specific, amid rapid turnover in evolution. Evaluated with PhastCons scores, plant lncRNAs showed similar conservation level to that of intergenic sequences, suggesting that most lincRNAs were young and with short evolutionary age. INDUCED BY PHOSPHATE STARVATION (IPS) was found so far to be the only plant lncRNA group with conserved motifs, which may play important roles in the adaptation of terrestrial life during migration from aquatic to terrestrial. Most highly and specially expressed lncRNAs formed co-expression network with coding genes, and their functions were believed to be closely related to their co-expression genes. CONCLUSION: The study revealed novel features and complexity of lncRNAs in plants through systematic analysis, providing important insights into the origination and evolution of plant lncRNAs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08602-9.
format Online
Article
Text
id pubmed-9118565
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-91185652022-05-20 Revealing the novel complexity of plant long non-coding RNA by strand-specific and whole transcriptome sequencing for evolutionarily representative plant species Zhu, Yan Chen, Longxian Hong, Xiangna Shi, Han Li, Xuan BMC Genomics Research BACKGROUND: Previous studies on plant long noncoding RNAs (lncRNAs) lacked consistency and suffered from many factors like heterogeneous data sources and experimental protocols, different plant tissues, inconsistent bioinformatics pipelines, etc. For example, the sequencing of RNAs with poly(A) tails excluded a large portion of lncRNAs without poly(A), and use of regular RNA-sequencing technique did not distinguish transcripts’ direction for lncRNAs. The current study was designed to systematically discover and analyze lncRNAs across eight evolutionarily representative plant species, using strand-specific (directional) and whole transcriptome sequencing (RiboMinus) technique. RESULTS: A total of 39,945 lncRNAs (25,350 lincRNAs and 14,595 lncNATs) were identified, which showed molecular features of lncRNAs that are consistent across divergent plant species but different from those of mRNA. Further, transposable elements (TEs) were found to play key roles in the origination of lncRNA, as significantly large number of lncRNAs were found to contain TEs in gene body and promoter region, and transcription of many lncRNAs was driven by TE promoters. The lncRNA sequences were divergent even in closely related species, and most plant lncRNAs were genus/species-specific, amid rapid turnover in evolution. Evaluated with PhastCons scores, plant lncRNAs showed similar conservation level to that of intergenic sequences, suggesting that most lincRNAs were young and with short evolutionary age. INDUCED BY PHOSPHATE STARVATION (IPS) was found so far to be the only plant lncRNA group with conserved motifs, which may play important roles in the adaptation of terrestrial life during migration from aquatic to terrestrial. Most highly and specially expressed lncRNAs formed co-expression network with coding genes, and their functions were believed to be closely related to their co-expression genes. CONCLUSION: The study revealed novel features and complexity of lncRNAs in plants through systematic analysis, providing important insights into the origination and evolution of plant lncRNAs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08602-9. BioMed Central 2022-05-19 /pmc/articles/PMC9118565/ /pubmed/35590257 http://dx.doi.org/10.1186/s12864-022-08602-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhu, Yan
Chen, Longxian
Hong, Xiangna
Shi, Han
Li, Xuan
Revealing the novel complexity of plant long non-coding RNA by strand-specific and whole transcriptome sequencing for evolutionarily representative plant species
title Revealing the novel complexity of plant long non-coding RNA by strand-specific and whole transcriptome sequencing for evolutionarily representative plant species
title_full Revealing the novel complexity of plant long non-coding RNA by strand-specific and whole transcriptome sequencing for evolutionarily representative plant species
title_fullStr Revealing the novel complexity of plant long non-coding RNA by strand-specific and whole transcriptome sequencing for evolutionarily representative plant species
title_full_unstemmed Revealing the novel complexity of plant long non-coding RNA by strand-specific and whole transcriptome sequencing for evolutionarily representative plant species
title_short Revealing the novel complexity of plant long non-coding RNA by strand-specific and whole transcriptome sequencing for evolutionarily representative plant species
title_sort revealing the novel complexity of plant long non-coding rna by strand-specific and whole transcriptome sequencing for evolutionarily representative plant species
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9118565/
https://www.ncbi.nlm.nih.gov/pubmed/35590257
http://dx.doi.org/10.1186/s12864-022-08602-9
work_keys_str_mv AT zhuyan revealingthenovelcomplexityofplantlongnoncodingrnabystrandspecificandwholetranscriptomesequencingforevolutionarilyrepresentativeplantspecies
AT chenlongxian revealingthenovelcomplexityofplantlongnoncodingrnabystrandspecificandwholetranscriptomesequencingforevolutionarilyrepresentativeplantspecies
AT hongxiangna revealingthenovelcomplexityofplantlongnoncodingrnabystrandspecificandwholetranscriptomesequencingforevolutionarilyrepresentativeplantspecies
AT shihan revealingthenovelcomplexityofplantlongnoncodingrnabystrandspecificandwholetranscriptomesequencingforevolutionarilyrepresentativeplantspecies
AT lixuan revealingthenovelcomplexityofplantlongnoncodingrnabystrandspecificandwholetranscriptomesequencingforevolutionarilyrepresentativeplantspecies