Cargando…

Open reading frame dominance indicates protein‐coding potential of RNAs

Recent studies have identified numerous RNAs with both coding and noncoding functions. However, the sequence characteristics that determine this bifunctionality remain largely unknown. In the present study, we develop and test the open reading frame (ORF) dominance score, which we define as the frac...

Descripción completa

Detalles Bibliográficos
Autores principales: Suenaga, Yusuke, Kato, Mamoru, Nagai, Momoko, Nakatani, Kazuma, Kogashi, Hiroyuki, Kobatake, Miho, Makino, Takashi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171421/
https://www.ncbi.nlm.nih.gov/pubmed/35438231
http://dx.doi.org/10.15252/embr.202154321
_version_ 1784721662041653248
author Suenaga, Yusuke
Kato, Mamoru
Nagai, Momoko
Nakatani, Kazuma
Kogashi, Hiroyuki
Kobatake, Miho
Makino, Takashi
author_facet Suenaga, Yusuke
Kato, Mamoru
Nagai, Momoko
Nakatani, Kazuma
Kogashi, Hiroyuki
Kobatake, Miho
Makino, Takashi
author_sort Suenaga, Yusuke
collection PubMed
description Recent studies have identified numerous RNAs with both coding and noncoding functions. However, the sequence characteristics that determine this bifunctionality remain largely unknown. In the present study, we develop and test the open reading frame (ORF) dominance score, which we define as the fraction of the longest ORF in the sum of all putative ORF lengths. This score correlates with translation efficiency in coding transcripts and with translation of noncoding RNAs. In bacteria and archaea, coding and noncoding transcripts have narrow distributions of high and low ORF dominance, respectively, whereas those of eukaryotes show relatively broader ORF dominance distributions, with considerable overlap between coding and noncoding transcripts. The extent of overlap positively and negatively correlates with the mutation rate of genomes and the effective population size of species, respectively. Tissue‐specific transcripts show higher ORF dominance than ubiquitously expressed transcripts, and the majority of tissue‐specific transcripts are expressed in mature testes. These data suggest that the decrease in population size and the emergence of testes in eukaryotic organisms allowed for the evolution of potentially bifunctional RNAs.
format Online
Article
Text
id pubmed-9171421
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-91714212022-06-16 Open reading frame dominance indicates protein‐coding potential of RNAs Suenaga, Yusuke Kato, Mamoru Nagai, Momoko Nakatani, Kazuma Kogashi, Hiroyuki Kobatake, Miho Makino, Takashi EMBO Rep Articles Recent studies have identified numerous RNAs with both coding and noncoding functions. However, the sequence characteristics that determine this bifunctionality remain largely unknown. In the present study, we develop and test the open reading frame (ORF) dominance score, which we define as the fraction of the longest ORF in the sum of all putative ORF lengths. This score correlates with translation efficiency in coding transcripts and with translation of noncoding RNAs. In bacteria and archaea, coding and noncoding transcripts have narrow distributions of high and low ORF dominance, respectively, whereas those of eukaryotes show relatively broader ORF dominance distributions, with considerable overlap between coding and noncoding transcripts. The extent of overlap positively and negatively correlates with the mutation rate of genomes and the effective population size of species, respectively. Tissue‐specific transcripts show higher ORF dominance than ubiquitously expressed transcripts, and the majority of tissue‐specific transcripts are expressed in mature testes. These data suggest that the decrease in population size and the emergence of testes in eukaryotic organisms allowed for the evolution of potentially bifunctional RNAs. John Wiley and Sons Inc. 2022-04-19 /pmc/articles/PMC9171421/ /pubmed/35438231 http://dx.doi.org/10.15252/embr.202154321 Text en © 2022 The Authors. Published under the terms of the CC BY 4.0 license https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Suenaga, Yusuke
Kato, Mamoru
Nagai, Momoko
Nakatani, Kazuma
Kogashi, Hiroyuki
Kobatake, Miho
Makino, Takashi
Open reading frame dominance indicates protein‐coding potential of RNAs
title Open reading frame dominance indicates protein‐coding potential of RNAs
title_full Open reading frame dominance indicates protein‐coding potential of RNAs
title_fullStr Open reading frame dominance indicates protein‐coding potential of RNAs
title_full_unstemmed Open reading frame dominance indicates protein‐coding potential of RNAs
title_short Open reading frame dominance indicates protein‐coding potential of RNAs
title_sort open reading frame dominance indicates protein‐coding potential of rnas
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171421/
https://www.ncbi.nlm.nih.gov/pubmed/35438231
http://dx.doi.org/10.15252/embr.202154321
work_keys_str_mv AT suenagayusuke openreadingframedominanceindicatesproteincodingpotentialofrnas
AT katomamoru openreadingframedominanceindicatesproteincodingpotentialofrnas
AT nagaimomoko openreadingframedominanceindicatesproteincodingpotentialofrnas
AT nakatanikazuma openreadingframedominanceindicatesproteincodingpotentialofrnas
AT kogashihiroyuki openreadingframedominanceindicatesproteincodingpotentialofrnas
AT kobatakemiho openreadingframedominanceindicatesproteincodingpotentialofrnas
AT makinotakashi openreadingframedominanceindicatesproteincodingpotentialofrnas