Cargando…

Weighted Gene Co-Expression Analyses Point to Long Non-Coding RNA Hub Genes at Different Schistosoma mansoni Life-Cycle Stages

Long non-coding RNAs (lncRNAs) (>200 nt) are expressed at levels lower than those of the protein-coding mRNAs, and in all eukaryotic model species where they have been characterized, they are transcribed from thousands of different genomic loci. In humans, some four dozen lncRNAs have been studie...

Descripción completa

Detalles Bibliográficos
Autores principales: Maciel, Lucas F., Morales-Vicente, David A., Silveira, Gilbert O., Ribeiro, Raphael O., Olberg, Giovanna G. O., Pires, David S., Amaral, Murilo S., Verjovski-Almeida, Sergio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752179/
https://www.ncbi.nlm.nih.gov/pubmed/31572441
http://dx.doi.org/10.3389/fgene.2019.00823
_version_ 1783452746870947840
author Maciel, Lucas F.
Morales-Vicente, David A.
Silveira, Gilbert O.
Ribeiro, Raphael O.
Olberg, Giovanna G. O.
Pires, David S.
Amaral, Murilo S.
Verjovski-Almeida, Sergio
author_facet Maciel, Lucas F.
Morales-Vicente, David A.
Silveira, Gilbert O.
Ribeiro, Raphael O.
Olberg, Giovanna G. O.
Pires, David S.
Amaral, Murilo S.
Verjovski-Almeida, Sergio
author_sort Maciel, Lucas F.
collection PubMed
description Long non-coding RNAs (lncRNAs) (>200 nt) are expressed at levels lower than those of the protein-coding mRNAs, and in all eukaryotic model species where they have been characterized, they are transcribed from thousands of different genomic loci. In humans, some four dozen lncRNAs have been studied in detail, and they have been shown to play important roles in transcriptional regulation, acting in conjunction with transcription factors and epigenetic marks to modulate the tissue-type specific programs of transcriptional gene activation and repression. In Schistosoma mansoni, around 10,000 lncRNAs have been identified in previous works. However, the limited number of RNA-sequencing (RNA-seq) libraries that had been previously assessed, together with the use of old and incomplete versions of the S. mansoni genome and protein-coding transcriptome annotations, have hampered the identification of all lncRNAs expressed in the parasite. Here we have used 633 publicly available S. mansoni RNA-seq libraries from whole worms at different stages (n = 121), from isolated tissues (n = 24), from cell-populations (n = 81), and from single-cells (n = 407). We have assembled a set of 16,583 lncRNA transcripts originated from 10,024 genes, of which 11,022 are novel S. mansoni lncRNA transcripts, whereas the remaining 5,561 transcripts comprise 120 lncRNAs that are identical to and 5,441 lncRNAs that have gene overlap with S. mansoni lncRNAs already reported in previous works. Most importantly, our more stringent assembly and filtering pipeline has identified and removed a set of 4,293 lncRNA transcripts from previous publications that were in fact derived from partially processed mRNAs with intron retention. We have used weighted gene co-expression network analyses and identified 15 different gene co-expression modules. Each parasite life-cycle stage has at least one highly correlated gene co-expression module, and each module is comprised of hundreds to thousands lncRNAs and mRNAs having correlated co-expression patterns at different stages. Inspection of the top most highly connected genes within the modules’ networks has shown that different lncRNAs are hub genes at different life-cycle stages, being among the most promising candidate lncRNAs to be further explored for functional characterization.
format Online
Article
Text
id pubmed-6752179
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67521792019-09-30 Weighted Gene Co-Expression Analyses Point to Long Non-Coding RNA Hub Genes at Different Schistosoma mansoni Life-Cycle Stages Maciel, Lucas F. Morales-Vicente, David A. Silveira, Gilbert O. Ribeiro, Raphael O. Olberg, Giovanna G. O. Pires, David S. Amaral, Murilo S. Verjovski-Almeida, Sergio Front Genet Genetics Long non-coding RNAs (lncRNAs) (>200 nt) are expressed at levels lower than those of the protein-coding mRNAs, and in all eukaryotic model species where they have been characterized, they are transcribed from thousands of different genomic loci. In humans, some four dozen lncRNAs have been studied in detail, and they have been shown to play important roles in transcriptional regulation, acting in conjunction with transcription factors and epigenetic marks to modulate the tissue-type specific programs of transcriptional gene activation and repression. In Schistosoma mansoni, around 10,000 lncRNAs have been identified in previous works. However, the limited number of RNA-sequencing (RNA-seq) libraries that had been previously assessed, together with the use of old and incomplete versions of the S. mansoni genome and protein-coding transcriptome annotations, have hampered the identification of all lncRNAs expressed in the parasite. Here we have used 633 publicly available S. mansoni RNA-seq libraries from whole worms at different stages (n = 121), from isolated tissues (n = 24), from cell-populations (n = 81), and from single-cells (n = 407). We have assembled a set of 16,583 lncRNA transcripts originated from 10,024 genes, of which 11,022 are novel S. mansoni lncRNA transcripts, whereas the remaining 5,561 transcripts comprise 120 lncRNAs that are identical to and 5,441 lncRNAs that have gene overlap with S. mansoni lncRNAs already reported in previous works. Most importantly, our more stringent assembly and filtering pipeline has identified and removed a set of 4,293 lncRNA transcripts from previous publications that were in fact derived from partially processed mRNAs with intron retention. We have used weighted gene co-expression network analyses and identified 15 different gene co-expression modules. Each parasite life-cycle stage has at least one highly correlated gene co-expression module, and each module is comprised of hundreds to thousands lncRNAs and mRNAs having correlated co-expression patterns at different stages. Inspection of the top most highly connected genes within the modules’ networks has shown that different lncRNAs are hub genes at different life-cycle stages, being among the most promising candidate lncRNAs to be further explored for functional characterization. Frontiers Media S.A. 2019-09-12 /pmc/articles/PMC6752179/ /pubmed/31572441 http://dx.doi.org/10.3389/fgene.2019.00823 Text en Copyright © 2019 Maciel, Morales-Vicente, Silveira, Ribeiro, Olberg, Pires, Amaral and Verjovski-Almeida http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Maciel, Lucas F.
Morales-Vicente, David A.
Silveira, Gilbert O.
Ribeiro, Raphael O.
Olberg, Giovanna G. O.
Pires, David S.
Amaral, Murilo S.
Verjovski-Almeida, Sergio
Weighted Gene Co-Expression Analyses Point to Long Non-Coding RNA Hub Genes at Different Schistosoma mansoni Life-Cycle Stages
title Weighted Gene Co-Expression Analyses Point to Long Non-Coding RNA Hub Genes at Different Schistosoma mansoni Life-Cycle Stages
title_full Weighted Gene Co-Expression Analyses Point to Long Non-Coding RNA Hub Genes at Different Schistosoma mansoni Life-Cycle Stages
title_fullStr Weighted Gene Co-Expression Analyses Point to Long Non-Coding RNA Hub Genes at Different Schistosoma mansoni Life-Cycle Stages
title_full_unstemmed Weighted Gene Co-Expression Analyses Point to Long Non-Coding RNA Hub Genes at Different Schistosoma mansoni Life-Cycle Stages
title_short Weighted Gene Co-Expression Analyses Point to Long Non-Coding RNA Hub Genes at Different Schistosoma mansoni Life-Cycle Stages
title_sort weighted gene co-expression analyses point to long non-coding rna hub genes at different schistosoma mansoni life-cycle stages
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752179/
https://www.ncbi.nlm.nih.gov/pubmed/31572441
http://dx.doi.org/10.3389/fgene.2019.00823
work_keys_str_mv AT maciellucasf weightedgenecoexpressionanalysespointtolongnoncodingrnahubgenesatdifferentschistosomamansonilifecyclestages
AT moralesvicentedavida weightedgenecoexpressionanalysespointtolongnoncodingrnahubgenesatdifferentschistosomamansonilifecyclestages
AT silveiragilberto weightedgenecoexpressionanalysespointtolongnoncodingrnahubgenesatdifferentschistosomamansonilifecyclestages
AT ribeiroraphaelo weightedgenecoexpressionanalysespointtolongnoncodingrnahubgenesatdifferentschistosomamansonilifecyclestages
AT olberggiovannago weightedgenecoexpressionanalysespointtolongnoncodingrnahubgenesatdifferentschistosomamansonilifecyclestages
AT piresdavids weightedgenecoexpressionanalysespointtolongnoncodingrnahubgenesatdifferentschistosomamansonilifecyclestages
AT amaralmurilos weightedgenecoexpressionanalysespointtolongnoncodingrnahubgenesatdifferentschistosomamansonilifecyclestages
AT verjovskialmeidasergio weightedgenecoexpressionanalysespointtolongnoncodingrnahubgenesatdifferentschistosomamansonilifecyclestages