Cargando…

Conserved regions in long non-coding RNAs contain abundant translation and protein–RNA interaction signatures

The mammalian transcriptome includes thousands of transcripts that do not correspond to annotated protein-coding genes and that are known as long non-coding RNAs (lncRNAs). A handful of lncRNAs have well-characterized regulatory functions but the biological significance of the majority of them is no...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruiz-Orera, Jorge, Albà, M Mar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671363/
https://www.ncbi.nlm.nih.gov/pubmed/33575549
http://dx.doi.org/10.1093/nargab/lqz002
_version_ 1783610914861219840
author Ruiz-Orera, Jorge
Albà, M Mar
author_facet Ruiz-Orera, Jorge
Albà, M Mar
author_sort Ruiz-Orera, Jorge
collection PubMed
description The mammalian transcriptome includes thousands of transcripts that do not correspond to annotated protein-coding genes and that are known as long non-coding RNAs (lncRNAs). A handful of lncRNAs have well-characterized regulatory functions but the biological significance of the majority of them is not well understood. LncRNAs that are conserved between mice and humans are likely to be enriched in functional sequences. Here, we investigate the presence of different types of ribosome profiling signatures in lncRNAs and how they relate to sequence conservation. We find that lncRNA-conserved regions contain three times more ORFs with translation evidence than non-conserved ones, and identify nine cases that display significant sequence constraints at the amino acid sequence level. The study also reveals that conserved regions in intergenic lncRNAs are significantly enriched in protein–RNA interaction signatures when compared to non-conserved ones; this includes sites in well-characterized lncRNAs, such as Cyrano, Malat1, Neat1 and Meg3, as well as in tens of lncRNAs of unknown function. This work illustrates how the analysis of ribosome profiling data coupled with evolutionary analysis provides new opportunities to explore the lncRNA functional landscape.
format Online
Article
Text
id pubmed-7671363
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-76713632021-02-10 Conserved regions in long non-coding RNAs contain abundant translation and protein–RNA interaction signatures Ruiz-Orera, Jorge Albà, M Mar NAR Genom Bioinform Standard Article The mammalian transcriptome includes thousands of transcripts that do not correspond to annotated protein-coding genes and that are known as long non-coding RNAs (lncRNAs). A handful of lncRNAs have well-characterized regulatory functions but the biological significance of the majority of them is not well understood. LncRNAs that are conserved between mice and humans are likely to be enriched in functional sequences. Here, we investigate the presence of different types of ribosome profiling signatures in lncRNAs and how they relate to sequence conservation. We find that lncRNA-conserved regions contain three times more ORFs with translation evidence than non-conserved ones, and identify nine cases that display significant sequence constraints at the amino acid sequence level. The study also reveals that conserved regions in intergenic lncRNAs are significantly enriched in protein–RNA interaction signatures when compared to non-conserved ones; this includes sites in well-characterized lncRNAs, such as Cyrano, Malat1, Neat1 and Meg3, as well as in tens of lncRNAs of unknown function. This work illustrates how the analysis of ribosome profiling data coupled with evolutionary analysis provides new opportunities to explore the lncRNA functional landscape. Oxford University Press 2019-07-05 /pmc/articles/PMC7671363/ /pubmed/33575549 http://dx.doi.org/10.1093/nargab/lqz002 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Ruiz-Orera, Jorge
Albà, M Mar
Conserved regions in long non-coding RNAs contain abundant translation and protein–RNA interaction signatures
title Conserved regions in long non-coding RNAs contain abundant translation and protein–RNA interaction signatures
title_full Conserved regions in long non-coding RNAs contain abundant translation and protein–RNA interaction signatures
title_fullStr Conserved regions in long non-coding RNAs contain abundant translation and protein–RNA interaction signatures
title_full_unstemmed Conserved regions in long non-coding RNAs contain abundant translation and protein–RNA interaction signatures
title_short Conserved regions in long non-coding RNAs contain abundant translation and protein–RNA interaction signatures
title_sort conserved regions in long non-coding rnas contain abundant translation and protein–rna interaction signatures
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671363/
https://www.ncbi.nlm.nih.gov/pubmed/33575549
http://dx.doi.org/10.1093/nargab/lqz002
work_keys_str_mv AT ruizorerajorge conservedregionsinlongnoncodingrnascontainabundanttranslationandproteinrnainteractionsignatures
AT albammar conservedregionsinlongnoncodingrnascontainabundanttranslationandproteinrnainteractionsignatures