Cargando…
Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns
The prevailing COVID-19 pandemic has drawn the attention of the scientific community to study the evolutionary origin of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2). This study is a comprehensive quantitative analysis of the protein-coding sequences of seven human coronaviruses (HC...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8131179/ https://www.ncbi.nlm.nih.gov/pubmed/34019999 http://dx.doi.org/10.1016/j.ygeno.2021.05.008 |
_version_ | 1783694664608514048 |
---|---|
author | Das, Jayanta Kumar Roy, Swarup |
author_facet | Das, Jayanta Kumar Roy, Swarup |
author_sort | Das, Jayanta Kumar |
collection | PubMed |
description | The prevailing COVID-19 pandemic has drawn the attention of the scientific community to study the evolutionary origin of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2). This study is a comprehensive quantitative analysis of the protein-coding sequences of seven human coronaviruses (HCoVs) to decipher the nucleotide sequence variability and codon usage patterns. It is essential to understand the survival ability of the viruses, their adaptation to hosts, and their evolution. The current analysis revealed a high abundance of the relative dinucleotide (odds ratio), GC and CT pairs in the first and last two codon positions, respectively, as well as a low abundance of the CG pair in the last two positions of the codon, which might be related to the evolution of the viruses. A remarkable level of variability of GC content in the third position of the codon among the seven coronaviruses was observed. Codons with high RSCU values are primarily from the aliphatic and hydroxyl amino acid groups, and codons with low RSCU values belong to the aliphatic, cyclic, positively charged, and sulfur-containing amino acid groups. In order to elucidate the evolutionary processes of the seven coronaviruses, a phylogenetic tree (dendrogram) was constructed based on the RSCU scores of the codons. The severe and mild categories CoVs were positioned in different clades. A comparative phylogenetic study with other coronaviruses depicted that SARS-CoV-2 is close to the CoV isolated from pangolins (Manis javanica, Pangolin-CoV) and cats (Felis catus, SARS(r)-CoV). Further analysis of the effective number of codon (ENC) usage bias showed a relatively higher bias for SARS-CoV and MERS-CoV compared to SARS-CoV-2. The ENC plot against GC3 suggested that the mutational bias might have a role in determining the codon usage variation among candidate viruses. A codon adaptability study on a few human host parasites (from different kingdoms), including CoVs, showed a diverse adaptability pattern. SARS-CoV-2 and SARS-CoV exhibit relatively lower but similar codon adaptability compared to MERS-CoV. |
format | Online Article Text |
id | pubmed-8131179 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81311792021-05-19 Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns Das, Jayanta Kumar Roy, Swarup Genomics Original Article The prevailing COVID-19 pandemic has drawn the attention of the scientific community to study the evolutionary origin of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2). This study is a comprehensive quantitative analysis of the protein-coding sequences of seven human coronaviruses (HCoVs) to decipher the nucleotide sequence variability and codon usage patterns. It is essential to understand the survival ability of the viruses, their adaptation to hosts, and their evolution. The current analysis revealed a high abundance of the relative dinucleotide (odds ratio), GC and CT pairs in the first and last two codon positions, respectively, as well as a low abundance of the CG pair in the last two positions of the codon, which might be related to the evolution of the viruses. A remarkable level of variability of GC content in the third position of the codon among the seven coronaviruses was observed. Codons with high RSCU values are primarily from the aliphatic and hydroxyl amino acid groups, and codons with low RSCU values belong to the aliphatic, cyclic, positively charged, and sulfur-containing amino acid groups. In order to elucidate the evolutionary processes of the seven coronaviruses, a phylogenetic tree (dendrogram) was constructed based on the RSCU scores of the codons. The severe and mild categories CoVs were positioned in different clades. A comparative phylogenetic study with other coronaviruses depicted that SARS-CoV-2 is close to the CoV isolated from pangolins (Manis javanica, Pangolin-CoV) and cats (Felis catus, SARS(r)-CoV). Further analysis of the effective number of codon (ENC) usage bias showed a relatively higher bias for SARS-CoV and MERS-CoV compared to SARS-CoV-2. The ENC plot against GC3 suggested that the mutational bias might have a role in determining the codon usage variation among candidate viruses. A codon adaptability study on a few human host parasites (from different kingdoms), including CoVs, showed a diverse adaptability pattern. SARS-CoV-2 and SARS-CoV exhibit relatively lower but similar codon adaptability compared to MERS-CoV. Elsevier Inc. 2021-07 2021-05-19 /pmc/articles/PMC8131179/ /pubmed/34019999 http://dx.doi.org/10.1016/j.ygeno.2021.05.008 Text en © 2021 Elsevier Inc. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Original Article Das, Jayanta Kumar Roy, Swarup Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns |
title | Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns |
title_full | Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns |
title_fullStr | Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns |
title_full_unstemmed | Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns |
title_short | Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns |
title_sort | comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8131179/ https://www.ncbi.nlm.nih.gov/pubmed/34019999 http://dx.doi.org/10.1016/j.ygeno.2021.05.008 |
work_keys_str_mv | AT dasjayantakumar comparativeanalysisofhumancoronavirusesfocusingonnucleotidevariabilityandsynonymouscodonusagepatterns AT royswarup comparativeanalysisofhumancoronavirusesfocusingonnucleotidevariabilityandsynonymouscodonusagepatterns |