Cargando…

Massive‐scale genomic analysis reveals SARS‐CoV‐2 mutation characteristics and evolutionary trends

The severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) pandemic resulted in significant societal costs. Hence, an in‐depth understanding of SARS‐CoV‐2 virus mutation and its evolution will help determine the direction of the COVID‐19 pandemic. In this study, we identified 296,728 de novo m...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Yamin, Wang, Min, Lin, Wenchao, Dong, Wei, Xu, Jianguo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9538474/
https://www.ncbi.nlm.nih.gov/pubmed/37732331
http://dx.doi.org/10.1002/mlf2.12040
_version_ 1784803354366443520
author Sun, Yamin
Wang, Min
Lin, Wenchao
Dong, Wei
Xu, Jianguo
author_facet Sun, Yamin
Wang, Min
Lin, Wenchao
Dong, Wei
Xu, Jianguo
author_sort Sun, Yamin
collection PubMed
description The severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) pandemic resulted in significant societal costs. Hence, an in‐depth understanding of SARS‐CoV‐2 virus mutation and its evolution will help determine the direction of the COVID‐19 pandemic. In this study, we identified 296,728 de novo mutations in more than 2,800,000 high‐quality SARS‐CoV‐2 genomes. All possible factors affecting the mutation frequency of SARS‐CoV‐2 in human hosts were analyzed, including zinc finger antiviral proteins, sequence context, amino acid change, and translation efficiency. As a result, we proposed that when adenine (A) and tyrosine (T) bases are in the context of AM (M stands for adenine or cytosine) or TA motif, A or T base has lower mutation frequency. Furthermore, we hypothesized that translation efficiency can affect the mutation frequency of the third position of the codon by the selection, which explains why SARS‐CoV‐2 prefers AT3 codons usage. In addition, we found a host‐specific asymmetric dinucleotide mutation frequency in the SARS‐CoV‐2 genome, which provides a new basis for determining the origin of the SARS‐CoV‐2. Finally, we summarize all possible factors affecting mutation frequency and provide insights into the mutation characteristics and evolutionary trends of SARS‐CoV‐2.
format Online
Article
Text
id pubmed-9538474
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-95384742022-10-11 Massive‐scale genomic analysis reveals SARS‐CoV‐2 mutation characteristics and evolutionary trends Sun, Yamin Wang, Min Lin, Wenchao Dong, Wei Xu, Jianguo mLife Original Research The severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) pandemic resulted in significant societal costs. Hence, an in‐depth understanding of SARS‐CoV‐2 virus mutation and its evolution will help determine the direction of the COVID‐19 pandemic. In this study, we identified 296,728 de novo mutations in more than 2,800,000 high‐quality SARS‐CoV‐2 genomes. All possible factors affecting the mutation frequency of SARS‐CoV‐2 in human hosts were analyzed, including zinc finger antiviral proteins, sequence context, amino acid change, and translation efficiency. As a result, we proposed that when adenine (A) and tyrosine (T) bases are in the context of AM (M stands for adenine or cytosine) or TA motif, A or T base has lower mutation frequency. Furthermore, we hypothesized that translation efficiency can affect the mutation frequency of the third position of the codon by the selection, which explains why SARS‐CoV‐2 prefers AT3 codons usage. In addition, we found a host‐specific asymmetric dinucleotide mutation frequency in the SARS‐CoV‐2 genome, which provides a new basis for determining the origin of the SARS‐CoV‐2. Finally, we summarize all possible factors affecting mutation frequency and provide insights into the mutation characteristics and evolutionary trends of SARS‐CoV‐2. John Wiley and Sons Inc. 2022-09-26 2022-09 /pmc/articles/PMC9538474/ /pubmed/37732331 http://dx.doi.org/10.1002/mlf2.12040 Text en © 2022 The Authors. mLife published by John Wiley & Sons Australia, Ltd. on behalf of Institute of Microbiology, Chinese Academy of Sciences. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Sun, Yamin
Wang, Min
Lin, Wenchao
Dong, Wei
Xu, Jianguo
Massive‐scale genomic analysis reveals SARS‐CoV‐2 mutation characteristics and evolutionary trends
title Massive‐scale genomic analysis reveals SARS‐CoV‐2 mutation characteristics and evolutionary trends
title_full Massive‐scale genomic analysis reveals SARS‐CoV‐2 mutation characteristics and evolutionary trends
title_fullStr Massive‐scale genomic analysis reveals SARS‐CoV‐2 mutation characteristics and evolutionary trends
title_full_unstemmed Massive‐scale genomic analysis reveals SARS‐CoV‐2 mutation characteristics and evolutionary trends
title_short Massive‐scale genomic analysis reveals SARS‐CoV‐2 mutation characteristics and evolutionary trends
title_sort massive‐scale genomic analysis reveals sars‐cov‐2 mutation characteristics and evolutionary trends
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9538474/
https://www.ncbi.nlm.nih.gov/pubmed/37732331
http://dx.doi.org/10.1002/mlf2.12040
work_keys_str_mv AT sunyamin massivescalegenomicanalysisrevealssarscov2mutationcharacteristicsandevolutionarytrends
AT wangmin massivescalegenomicanalysisrevealssarscov2mutationcharacteristicsandevolutionarytrends
AT linwenchao massivescalegenomicanalysisrevealssarscov2mutationcharacteristicsandevolutionarytrends
AT dongwei massivescalegenomicanalysisrevealssarscov2mutationcharacteristicsandevolutionarytrends
AT xujianguo massivescalegenomicanalysisrevealssarscov2mutationcharacteristicsandevolutionarytrends