Cargando…

Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein

Severe acute respiratory syndrome-related coronavirus (SARS-CoV-2), which still infects hundreds of thousands of people globally each day despite various countermeasures, has been mutating rapidly. Mutations in the spike (S) protein seem to play a vital role in viral stability, transmission, and ada...

Descripción completa

Detalles Bibliográficos
Autores principales: Patil, Shruti S., Catanese, Helen N., Brayton, Kelly A., Lofgren, Eric T., Gebremedhin, Assefaw H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9413517/
https://www.ncbi.nlm.nih.gov/pubmed/36016294
http://dx.doi.org/10.3390/v14081672
_version_ 1784775771211956224
author Patil, Shruti S.
Catanese, Helen N.
Brayton, Kelly A.
Lofgren, Eric T.
Gebremedhin, Assefaw H.
author_facet Patil, Shruti S.
Catanese, Helen N.
Brayton, Kelly A.
Lofgren, Eric T.
Gebremedhin, Assefaw H.
author_sort Patil, Shruti S.
collection PubMed
description Severe acute respiratory syndrome-related coronavirus (SARS-CoV-2), which still infects hundreds of thousands of people globally each day despite various countermeasures, has been mutating rapidly. Mutations in the spike (S) protein seem to play a vital role in viral stability, transmission, and adaptability. Therefore, to control the spread of the virus, it is important to gain insight into the evolution and transmission of the S protein. This study deals with the temporal and geographical distribution of mutant S proteins from sequences gathered across the US over a period of 19 months in 2020 and 2021. The S protein sequences are studied using two approaches: (i) multiple sequence alignment is used to identify prominent mutations and highly mutable regions and (ii) sequence similarity networks are subsequently employed to gain further insight and study mutation profiles of concerning variants across the defined time periods and states. Additionally, we tracked the variants using visualizations on geographical maps. The visualizations produced using the Directed Weighted All Nearest Neighbors (DiWANN) networks and maps provided insights into the transmission of the virus that reflect well the statistics reported for the time periods studied. We found that the networks created using DiWANN are superior to commonly used approximate distance networks created using BLAST bitscores. The study offers a richer computational approach to analyze the transmission profile of the prominent S protein mutations in SARS-CoV-2 and can be extended to other proteins and viruses.
format Online
Article
Text
id pubmed-9413517
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94135172022-08-27 Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein Patil, Shruti S. Catanese, Helen N. Brayton, Kelly A. Lofgren, Eric T. Gebremedhin, Assefaw H. Viruses Article Severe acute respiratory syndrome-related coronavirus (SARS-CoV-2), which still infects hundreds of thousands of people globally each day despite various countermeasures, has been mutating rapidly. Mutations in the spike (S) protein seem to play a vital role in viral stability, transmission, and adaptability. Therefore, to control the spread of the virus, it is important to gain insight into the evolution and transmission of the S protein. This study deals with the temporal and geographical distribution of mutant S proteins from sequences gathered across the US over a period of 19 months in 2020 and 2021. The S protein sequences are studied using two approaches: (i) multiple sequence alignment is used to identify prominent mutations and highly mutable regions and (ii) sequence similarity networks are subsequently employed to gain further insight and study mutation profiles of concerning variants across the defined time periods and states. Additionally, we tracked the variants using visualizations on geographical maps. The visualizations produced using the Directed Weighted All Nearest Neighbors (DiWANN) networks and maps provided insights into the transmission of the virus that reflect well the statistics reported for the time periods studied. We found that the networks created using DiWANN are superior to commonly used approximate distance networks created using BLAST bitscores. The study offers a richer computational approach to analyze the transmission profile of the prominent S protein mutations in SARS-CoV-2 and can be extended to other proteins and viruses. MDPI 2022-07-29 /pmc/articles/PMC9413517/ /pubmed/36016294 http://dx.doi.org/10.3390/v14081672 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Patil, Shruti S.
Catanese, Helen N.
Brayton, Kelly A.
Lofgren, Eric T.
Gebremedhin, Assefaw H.
Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein
title Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein
title_full Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein
title_fullStr Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein
title_full_unstemmed Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein
title_short Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein
title_sort sequence similarity network analysis provides insight into the temporal and geographical distribution of mutations in sars-cov-2 spike protein
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9413517/
https://www.ncbi.nlm.nih.gov/pubmed/36016294
http://dx.doi.org/10.3390/v14081672
work_keys_str_mv AT patilshrutis sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein
AT catanesehelenn sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein
AT braytonkellya sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein
AT lofgrenerict sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein
AT gebremedhinassefawh sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein