Cargando…

Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses

The coronaviruses (CoVs), including SARS-CoV-2, the agent of the ongoing deadly CoVID-19 pandemic (Coronavirus disease-2019), represent a highly complex and diverse class of RNA viruses with large genomes, complex gene repertoire, and intricate transcriptional and translational mechanisms. The 3′-te...

Descripción completa

Detalles Bibliográficos
Autor principal: Barik, Sailen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Author(s). Published by Elsevier B.V. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8053407/
https://www.ncbi.nlm.nih.gov/pubmed/33848683
http://dx.doi.org/10.1016/j.meegid.2021.104858
_version_ 1783680115345981440
author Barik, Sailen
author_facet Barik, Sailen
author_sort Barik, Sailen
collection PubMed
description The coronaviruses (CoVs), including SARS-CoV-2, the agent of the ongoing deadly CoVID-19 pandemic (Coronavirus disease-2019), represent a highly complex and diverse class of RNA viruses with large genomes, complex gene repertoire, and intricate transcriptional and translational mechanisms. The 3′-terminal one-third of the genome encodes four structural proteins, namely spike, envelope, membrane, and nucleocapsid, interspersed with genes for accessory proteins that are largely nonstructural and called ‘open reading frame’ (ORF) proteins with alphanumerical designations, but not in a consistent or sequential order. Here, I report a comparative study of these ORF proteins, mainly encoded in two gene clusters, i.e. between the Spike and the Envelope genes, and between the Membrane and the Nucleocapsid genes. For brevity and focus, a greater emphasis was placed on the first cluster, collectively designated as the ‘orf3 region’ for ease of referral. Overall, an apparently diverse set of ORFs, such as ORF3a, ORF3b, ORF3c, ORF3d, ORF4 and ORF5, but not necessarily numbered in that order on all CoV genomes, were analyzed along with other ORFs. Unexpectedly, the gene order or naming of the ORFs were never fully conserved even within the members of one Genus. These studies also unraveled hitherto unrecognized orf genes in alternative translational frames, encoding potentially novel polypeptides as well as some that are highly similar to known ORFs. Finally, several options of an inclusive and systematic numbering are proposed not only for the orf3 region but also for the other orf genes in the viral genome in an effort to regularize the apparently confusing names and orders. Regardless of the ultimate acceptability of one system over the others, this treatise is hoped to initiate an informed discourse in this area.
format Online
Article
Text
id pubmed-8053407
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Author(s). Published by Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-80534072021-04-19 Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses Barik, Sailen Infect Genet Evol Research Paper The coronaviruses (CoVs), including SARS-CoV-2, the agent of the ongoing deadly CoVID-19 pandemic (Coronavirus disease-2019), represent a highly complex and diverse class of RNA viruses with large genomes, complex gene repertoire, and intricate transcriptional and translational mechanisms. The 3′-terminal one-third of the genome encodes four structural proteins, namely spike, envelope, membrane, and nucleocapsid, interspersed with genes for accessory proteins that are largely nonstructural and called ‘open reading frame’ (ORF) proteins with alphanumerical designations, but not in a consistent or sequential order. Here, I report a comparative study of these ORF proteins, mainly encoded in two gene clusters, i.e. between the Spike and the Envelope genes, and between the Membrane and the Nucleocapsid genes. For brevity and focus, a greater emphasis was placed on the first cluster, collectively designated as the ‘orf3 region’ for ease of referral. Overall, an apparently diverse set of ORFs, such as ORF3a, ORF3b, ORF3c, ORF3d, ORF4 and ORF5, but not necessarily numbered in that order on all CoV genomes, were analyzed along with other ORFs. Unexpectedly, the gene order or naming of the ORFs were never fully conserved even within the members of one Genus. These studies also unraveled hitherto unrecognized orf genes in alternative translational frames, encoding potentially novel polypeptides as well as some that are highly similar to known ORFs. Finally, several options of an inclusive and systematic numbering are proposed not only for the orf3 region but also for the other orf genes in the viral genome in an effort to regularize the apparently confusing names and orders. Regardless of the ultimate acceptability of one system over the others, this treatise is hoped to initiate an informed discourse in this area. The Author(s). Published by Elsevier B.V. 2021-08 2021-04-18 /pmc/articles/PMC8053407/ /pubmed/33848683 http://dx.doi.org/10.1016/j.meegid.2021.104858 Text en © 2021 The Author(s) Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Research Paper
Barik, Sailen
Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses
title Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses
title_full Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses
title_fullStr Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses
title_full_unstemmed Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses
title_short Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses
title_sort systematizing the genomic order and relatedness in the open reading frames (orfs) of the coronaviruses
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8053407/
https://www.ncbi.nlm.nih.gov/pubmed/33848683
http://dx.doi.org/10.1016/j.meegid.2021.104858
work_keys_str_mv AT bariksailen systematizingthegenomicorderandrelatednessintheopenreadingframesorfsofthecoronaviruses