Cargando…

AI-based search for convergently expanding, advantageous mutations in SARS-CoV-2 by focusing on oligonucleotide frequencies

Among mutations that occur in SARS-CoV-2, efficient identification of mutations advantageous for viral replication and transmission is important to characterize and defeat this rampant virus. Mutations rapidly expanding frequency in a viral population are candidates for advantageous mutations, but n...

Descripción completa

Detalles Bibliográficos
Autores principales: Ikemura, Toshimichi, Iwasaki, Yuki, Wada, Kennosuke, Wada, Yoshiko, Abe, Takashi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9432735/
https://www.ncbi.nlm.nih.gov/pubmed/36044525
http://dx.doi.org/10.1371/journal.pone.0273860
_version_ 1784780452743086080
author Ikemura, Toshimichi
Iwasaki, Yuki
Wada, Kennosuke
Wada, Yoshiko
Abe, Takashi
author_facet Ikemura, Toshimichi
Iwasaki, Yuki
Wada, Kennosuke
Wada, Yoshiko
Abe, Takashi
author_sort Ikemura, Toshimichi
collection PubMed
description Among mutations that occur in SARS-CoV-2, efficient identification of mutations advantageous for viral replication and transmission is important to characterize and defeat this rampant virus. Mutations rapidly expanding frequency in a viral population are candidates for advantageous mutations, but neutral mutations hitchhiking with advantageous mutations are also likely to be included. To distinguish these, we focus on mutations that appear to occur independently in different lineages and expand in frequency in a convergent evolutionary manner. Batch-learning SOM (BLSOM) can separate SARS-CoV-2 genome sequences according by lineage from only providing the oligonucleotide composition. Focusing on remarkably expanding 20-mers, each of which is only represented by one copy in the viral genome, allows us to correlate the expanding 20-mers to mutations. Using visualization functions in BLSOM, we can efficiently identify mutations that have expanded remarkably both in the Omicron lineage, which is phylogenetically distinct from other lineages, and in other lineages. Most of these mutations involved changes in amino acids, but there were a few that did not, such as an intergenic mutation.
format Online
Article
Text
id pubmed-9432735
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-94327352022-09-01 AI-based search for convergently expanding, advantageous mutations in SARS-CoV-2 by focusing on oligonucleotide frequencies Ikemura, Toshimichi Iwasaki, Yuki Wada, Kennosuke Wada, Yoshiko Abe, Takashi PLoS One Research Article Among mutations that occur in SARS-CoV-2, efficient identification of mutations advantageous for viral replication and transmission is important to characterize and defeat this rampant virus. Mutations rapidly expanding frequency in a viral population are candidates for advantageous mutations, but neutral mutations hitchhiking with advantageous mutations are also likely to be included. To distinguish these, we focus on mutations that appear to occur independently in different lineages and expand in frequency in a convergent evolutionary manner. Batch-learning SOM (BLSOM) can separate SARS-CoV-2 genome sequences according by lineage from only providing the oligonucleotide composition. Focusing on remarkably expanding 20-mers, each of which is only represented by one copy in the viral genome, allows us to correlate the expanding 20-mers to mutations. Using visualization functions in BLSOM, we can efficiently identify mutations that have expanded remarkably both in the Omicron lineage, which is phylogenetically distinct from other lineages, and in other lineages. Most of these mutations involved changes in amino acids, but there were a few that did not, such as an intergenic mutation. Public Library of Science 2022-08-31 /pmc/articles/PMC9432735/ /pubmed/36044525 http://dx.doi.org/10.1371/journal.pone.0273860 Text en © 2022 Ikemura et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ikemura, Toshimichi
Iwasaki, Yuki
Wada, Kennosuke
Wada, Yoshiko
Abe, Takashi
AI-based search for convergently expanding, advantageous mutations in SARS-CoV-2 by focusing on oligonucleotide frequencies
title AI-based search for convergently expanding, advantageous mutations in SARS-CoV-2 by focusing on oligonucleotide frequencies
title_full AI-based search for convergently expanding, advantageous mutations in SARS-CoV-2 by focusing on oligonucleotide frequencies
title_fullStr AI-based search for convergently expanding, advantageous mutations in SARS-CoV-2 by focusing on oligonucleotide frequencies
title_full_unstemmed AI-based search for convergently expanding, advantageous mutations in SARS-CoV-2 by focusing on oligonucleotide frequencies
title_short AI-based search for convergently expanding, advantageous mutations in SARS-CoV-2 by focusing on oligonucleotide frequencies
title_sort ai-based search for convergently expanding, advantageous mutations in sars-cov-2 by focusing on oligonucleotide frequencies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9432735/
https://www.ncbi.nlm.nih.gov/pubmed/36044525
http://dx.doi.org/10.1371/journal.pone.0273860
work_keys_str_mv AT ikemuratoshimichi aibasedsearchforconvergentlyexpandingadvantageousmutationsinsarscov2byfocusingonoligonucleotidefrequencies
AT iwasakiyuki aibasedsearchforconvergentlyexpandingadvantageousmutationsinsarscov2byfocusingonoligonucleotidefrequencies
AT wadakennosuke aibasedsearchforconvergentlyexpandingadvantageousmutationsinsarscov2byfocusingonoligonucleotidefrequencies
AT wadayoshiko aibasedsearchforconvergentlyexpandingadvantageousmutationsinsarscov2byfocusingonoligonucleotidefrequencies
AT abetakashi aibasedsearchforconvergentlyexpandingadvantageousmutationsinsarscov2byfocusingonoligonucleotidefrequencies