Cargando…

Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns

The prevailing COVID-19 pandemic has drawn the attention of the scientific community to study the evolutionary origin of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2). This study is a comprehensive quantitative analysis of the protein-coding sequences of seven human coronaviruses (HC...

Descripción completa

Detalles Bibliográficos
Autores principales: Das, Jayanta Kumar, Roy, Swarup
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8131179/
https://www.ncbi.nlm.nih.gov/pubmed/34019999
http://dx.doi.org/10.1016/j.ygeno.2021.05.008
_version_ 1783694664608514048
author Das, Jayanta Kumar
Roy, Swarup
author_facet Das, Jayanta Kumar
Roy, Swarup
author_sort Das, Jayanta Kumar
collection PubMed
description The prevailing COVID-19 pandemic has drawn the attention of the scientific community to study the evolutionary origin of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2). This study is a comprehensive quantitative analysis of the protein-coding sequences of seven human coronaviruses (HCoVs) to decipher the nucleotide sequence variability and codon usage patterns. It is essential to understand the survival ability of the viruses, their adaptation to hosts, and their evolution. The current analysis revealed a high abundance of the relative dinucleotide (odds ratio), GC and CT pairs in the first and last two codon positions, respectively, as well as a low abundance of the CG pair in the last two positions of the codon, which might be related to the evolution of the viruses. A remarkable level of variability of GC content in the third position of the codon among the seven coronaviruses was observed. Codons with high RSCU values are primarily from the aliphatic and hydroxyl amino acid groups, and codons with low RSCU values belong to the aliphatic, cyclic, positively charged, and sulfur-containing amino acid groups. In order to elucidate the evolutionary processes of the seven coronaviruses, a phylogenetic tree (dendrogram) was constructed based on the RSCU scores of the codons. The severe and mild categories CoVs were positioned in different clades. A comparative phylogenetic study with other coronaviruses depicted that SARS-CoV-2 is close to the CoV isolated from pangolins (Manis javanica, Pangolin-CoV) and cats (Felis catus, SARS(r)-CoV). Further analysis of the effective number of codon (ENC) usage bias showed a relatively higher bias for SARS-CoV and MERS-CoV compared to SARS-CoV-2. The ENC plot against GC3 suggested that the mutational bias might have a role in determining the codon usage variation among candidate viruses. A codon adaptability study on a few human host parasites (from different kingdoms), including CoVs, showed a diverse adaptability pattern. SARS-CoV-2 and SARS-CoV exhibit relatively lower but similar codon adaptability compared to MERS-CoV.
format Online
Article
Text
id pubmed-8131179
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier Inc.
record_format MEDLINE/PubMed
spelling pubmed-81311792021-05-19 Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns Das, Jayanta Kumar Roy, Swarup Genomics Original Article The prevailing COVID-19 pandemic has drawn the attention of the scientific community to study the evolutionary origin of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2). This study is a comprehensive quantitative analysis of the protein-coding sequences of seven human coronaviruses (HCoVs) to decipher the nucleotide sequence variability and codon usage patterns. It is essential to understand the survival ability of the viruses, their adaptation to hosts, and their evolution. The current analysis revealed a high abundance of the relative dinucleotide (odds ratio), GC and CT pairs in the first and last two codon positions, respectively, as well as a low abundance of the CG pair in the last two positions of the codon, which might be related to the evolution of the viruses. A remarkable level of variability of GC content in the third position of the codon among the seven coronaviruses was observed. Codons with high RSCU values are primarily from the aliphatic and hydroxyl amino acid groups, and codons with low RSCU values belong to the aliphatic, cyclic, positively charged, and sulfur-containing amino acid groups. In order to elucidate the evolutionary processes of the seven coronaviruses, a phylogenetic tree (dendrogram) was constructed based on the RSCU scores of the codons. The severe and mild categories CoVs were positioned in different clades. A comparative phylogenetic study with other coronaviruses depicted that SARS-CoV-2 is close to the CoV isolated from pangolins (Manis javanica, Pangolin-CoV) and cats (Felis catus, SARS(r)-CoV). Further analysis of the effective number of codon (ENC) usage bias showed a relatively higher bias for SARS-CoV and MERS-CoV compared to SARS-CoV-2. The ENC plot against GC3 suggested that the mutational bias might have a role in determining the codon usage variation among candidate viruses. A codon adaptability study on a few human host parasites (from different kingdoms), including CoVs, showed a diverse adaptability pattern. SARS-CoV-2 and SARS-CoV exhibit relatively lower but similar codon adaptability compared to MERS-CoV. Elsevier Inc. 2021-07 2021-05-19 /pmc/articles/PMC8131179/ /pubmed/34019999 http://dx.doi.org/10.1016/j.ygeno.2021.05.008 Text en © 2021 Elsevier Inc. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Original Article
Das, Jayanta Kumar
Roy, Swarup
Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns
title Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns
title_full Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns
title_fullStr Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns
title_full_unstemmed Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns
title_short Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns
title_sort comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8131179/
https://www.ncbi.nlm.nih.gov/pubmed/34019999
http://dx.doi.org/10.1016/j.ygeno.2021.05.008
work_keys_str_mv AT dasjayantakumar comparativeanalysisofhumancoronavirusesfocusingonnucleotidevariabilityandsynonymouscodonusagepatterns
AT royswarup comparativeanalysisofhumancoronavirusesfocusingonnucleotidevariabilityandsynonymouscodonusagepatterns