Cargando…

A case for a negative-strand coding sequence in a group of positive-sense RNA viruses

Positive-sense single-stranded RNA viruses form the largest and most diverse group of eukaryote-infecting viruses. Their genomes comprise one or more segments of coding-sense RNA that function directly as messenger RNAs upon release into the cytoplasm of infected cells. Positive-sense RNA viruses ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Dinan, Adam M, Lukhovitskaya, Nina I, Olendraite, Ingrida, Firth, Andrew E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7010960/
https://www.ncbi.nlm.nih.gov/pubmed/32064120
http://dx.doi.org/10.1093/ve/veaa007
_version_ 1783495976367947776
author Dinan, Adam M
Lukhovitskaya, Nina I
Olendraite, Ingrida
Firth, Andrew E
author_facet Dinan, Adam M
Lukhovitskaya, Nina I
Olendraite, Ingrida
Firth, Andrew E
author_sort Dinan, Adam M
collection PubMed
description Positive-sense single-stranded RNA viruses form the largest and most diverse group of eukaryote-infecting viruses. Their genomes comprise one or more segments of coding-sense RNA that function directly as messenger RNAs upon release into the cytoplasm of infected cells. Positive-sense RNA viruses are generally accepted to encode proteins solely on the positive strand. However, we previously identified a surprisingly long (∼1,000-codon) open reading frame (ORF) on the negative strand of some members of the family Narnaviridae which, together with RNA bacteriophages of the family Leviviridae, form a sister group to all other positive-sense RNA viruses. Here, we completed the genomes of three mosquito-associated narnaviruses, all of which have the long reverse-frame ORF. We systematically identified narnaviral sequences in public data sets from a wide range of sources, including arthropod, fungal, and plant transcriptomic data sets. Long reverse-frame ORFs are widespread in one clade of narnaviruses, where they frequently occupy >95 per cent of the genome. The reverse-frame ORFs correspond to a specific avoidance of CUA, UUA, and UCA codons (i.e. stop codon reverse complements) in the forward-frame RNA-dependent RNA polymerase ORF. However, absence of these codons cannot be explained by other factors such as inability to decode these codons or GC3 bias. Together with other analyses, we provide the strongest evidence yet of coding capacity on the negative strand of a positive-sense RNA virus. As these ORFs comprise some of the longest known overlapping genes, their study may be of broad relevance to understanding overlapping gene evolution and de novo origin of genes.
format Online
Article
Text
id pubmed-7010960
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-70109602020-02-14 A case for a negative-strand coding sequence in a group of positive-sense RNA viruses Dinan, Adam M Lukhovitskaya, Nina I Olendraite, Ingrida Firth, Andrew E Virus Evol Research Article Positive-sense single-stranded RNA viruses form the largest and most diverse group of eukaryote-infecting viruses. Their genomes comprise one or more segments of coding-sense RNA that function directly as messenger RNAs upon release into the cytoplasm of infected cells. Positive-sense RNA viruses are generally accepted to encode proteins solely on the positive strand. However, we previously identified a surprisingly long (∼1,000-codon) open reading frame (ORF) on the negative strand of some members of the family Narnaviridae which, together with RNA bacteriophages of the family Leviviridae, form a sister group to all other positive-sense RNA viruses. Here, we completed the genomes of three mosquito-associated narnaviruses, all of which have the long reverse-frame ORF. We systematically identified narnaviral sequences in public data sets from a wide range of sources, including arthropod, fungal, and plant transcriptomic data sets. Long reverse-frame ORFs are widespread in one clade of narnaviruses, where they frequently occupy >95 per cent of the genome. The reverse-frame ORFs correspond to a specific avoidance of CUA, UUA, and UCA codons (i.e. stop codon reverse complements) in the forward-frame RNA-dependent RNA polymerase ORF. However, absence of these codons cannot be explained by other factors such as inability to decode these codons or GC3 bias. Together with other analyses, we provide the strongest evidence yet of coding capacity on the negative strand of a positive-sense RNA virus. As these ORFs comprise some of the longest known overlapping genes, their study may be of broad relevance to understanding overlapping gene evolution and de novo origin of genes. Oxford University Press 2020-02-10 /pmc/articles/PMC7010960/ /pubmed/32064120 http://dx.doi.org/10.1093/ve/veaa007 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dinan, Adam M
Lukhovitskaya, Nina I
Olendraite, Ingrida
Firth, Andrew E
A case for a negative-strand coding sequence in a group of positive-sense RNA viruses
title A case for a negative-strand coding sequence in a group of positive-sense RNA viruses
title_full A case for a negative-strand coding sequence in a group of positive-sense RNA viruses
title_fullStr A case for a negative-strand coding sequence in a group of positive-sense RNA viruses
title_full_unstemmed A case for a negative-strand coding sequence in a group of positive-sense RNA viruses
title_short A case for a negative-strand coding sequence in a group of positive-sense RNA viruses
title_sort case for a negative-strand coding sequence in a group of positive-sense rna viruses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7010960/
https://www.ncbi.nlm.nih.gov/pubmed/32064120
http://dx.doi.org/10.1093/ve/veaa007
work_keys_str_mv AT dinanadamm acaseforanegativestrandcodingsequenceinagroupofpositivesensernaviruses
AT lukhovitskayaninai acaseforanegativestrandcodingsequenceinagroupofpositivesensernaviruses
AT olendraiteingrida acaseforanegativestrandcodingsequenceinagroupofpositivesensernaviruses
AT firthandrewe acaseforanegativestrandcodingsequenceinagroupofpositivesensernaviruses
AT dinanadamm caseforanegativestrandcodingsequenceinagroupofpositivesensernaviruses
AT lukhovitskayaninai caseforanegativestrandcodingsequenceinagroupofpositivesensernaviruses
AT olendraiteingrida caseforanegativestrandcodingsequenceinagroupofpositivesensernaviruses
AT firthandrewe caseforanegativestrandcodingsequenceinagroupofpositivesensernaviruses