Cargando…
Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses
The coronaviruses (CoVs), including SARS-CoV-2, the agent of the ongoing deadly CoVID-19 pandemic (Coronavirus disease-2019), represent a highly complex and diverse class of RNA viruses with large genomes, complex gene repertoire, and intricate transcriptional and translational mechanisms. The 3′-te...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Author(s). Published by Elsevier B.V.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8053407/ https://www.ncbi.nlm.nih.gov/pubmed/33848683 http://dx.doi.org/10.1016/j.meegid.2021.104858 |
_version_ | 1783680115345981440 |
---|---|
author | Barik, Sailen |
author_facet | Barik, Sailen |
author_sort | Barik, Sailen |
collection | PubMed |
description | The coronaviruses (CoVs), including SARS-CoV-2, the agent of the ongoing deadly CoVID-19 pandemic (Coronavirus disease-2019), represent a highly complex and diverse class of RNA viruses with large genomes, complex gene repertoire, and intricate transcriptional and translational mechanisms. The 3′-terminal one-third of the genome encodes four structural proteins, namely spike, envelope, membrane, and nucleocapsid, interspersed with genes for accessory proteins that are largely nonstructural and called ‘open reading frame’ (ORF) proteins with alphanumerical designations, but not in a consistent or sequential order. Here, I report a comparative study of these ORF proteins, mainly encoded in two gene clusters, i.e. between the Spike and the Envelope genes, and between the Membrane and the Nucleocapsid genes. For brevity and focus, a greater emphasis was placed on the first cluster, collectively designated as the ‘orf3 region’ for ease of referral. Overall, an apparently diverse set of ORFs, such as ORF3a, ORF3b, ORF3c, ORF3d, ORF4 and ORF5, but not necessarily numbered in that order on all CoV genomes, were analyzed along with other ORFs. Unexpectedly, the gene order or naming of the ORFs were never fully conserved even within the members of one Genus. These studies also unraveled hitherto unrecognized orf genes in alternative translational frames, encoding potentially novel polypeptides as well as some that are highly similar to known ORFs. Finally, several options of an inclusive and systematic numbering are proposed not only for the orf3 region but also for the other orf genes in the viral genome in an effort to regularize the apparently confusing names and orders. Regardless of the ultimate acceptability of one system over the others, this treatise is hoped to initiate an informed discourse in this area. |
format | Online Article Text |
id | pubmed-8053407 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | The Author(s). Published by Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-80534072021-04-19 Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses Barik, Sailen Infect Genet Evol Research Paper The coronaviruses (CoVs), including SARS-CoV-2, the agent of the ongoing deadly CoVID-19 pandemic (Coronavirus disease-2019), represent a highly complex and diverse class of RNA viruses with large genomes, complex gene repertoire, and intricate transcriptional and translational mechanisms. The 3′-terminal one-third of the genome encodes four structural proteins, namely spike, envelope, membrane, and nucleocapsid, interspersed with genes for accessory proteins that are largely nonstructural and called ‘open reading frame’ (ORF) proteins with alphanumerical designations, but not in a consistent or sequential order. Here, I report a comparative study of these ORF proteins, mainly encoded in two gene clusters, i.e. between the Spike and the Envelope genes, and between the Membrane and the Nucleocapsid genes. For brevity and focus, a greater emphasis was placed on the first cluster, collectively designated as the ‘orf3 region’ for ease of referral. Overall, an apparently diverse set of ORFs, such as ORF3a, ORF3b, ORF3c, ORF3d, ORF4 and ORF5, but not necessarily numbered in that order on all CoV genomes, were analyzed along with other ORFs. Unexpectedly, the gene order or naming of the ORFs were never fully conserved even within the members of one Genus. These studies also unraveled hitherto unrecognized orf genes in alternative translational frames, encoding potentially novel polypeptides as well as some that are highly similar to known ORFs. Finally, several options of an inclusive and systematic numbering are proposed not only for the orf3 region but also for the other orf genes in the viral genome in an effort to regularize the apparently confusing names and orders. Regardless of the ultimate acceptability of one system over the others, this treatise is hoped to initiate an informed discourse in this area. The Author(s). Published by Elsevier B.V. 2021-08 2021-04-18 /pmc/articles/PMC8053407/ /pubmed/33848683 http://dx.doi.org/10.1016/j.meegid.2021.104858 Text en © 2021 The Author(s) Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Research Paper Barik, Sailen Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses |
title | Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses |
title_full | Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses |
title_fullStr | Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses |
title_full_unstemmed | Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses |
title_short | Systematizing the genomic order and relatedness in the open reading frames (ORFs) of the coronaviruses |
title_sort | systematizing the genomic order and relatedness in the open reading frames (orfs) of the coronaviruses |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8053407/ https://www.ncbi.nlm.nih.gov/pubmed/33848683 http://dx.doi.org/10.1016/j.meegid.2021.104858 |
work_keys_str_mv | AT bariksailen systematizingthegenomicorderandrelatednessintheopenreadingframesorfsofthecoronaviruses |