Cargando…
Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein
Severe acute respiratory syndrome-related coronavirus (SARS-CoV-2), which still infects hundreds of thousands of people globally each day despite various countermeasures, has been mutating rapidly. Mutations in the spike (S) protein seem to play a vital role in viral stability, transmission, and ada...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9413517/ https://www.ncbi.nlm.nih.gov/pubmed/36016294 http://dx.doi.org/10.3390/v14081672 |
_version_ | 1784775771211956224 |
---|---|
author | Patil, Shruti S. Catanese, Helen N. Brayton, Kelly A. Lofgren, Eric T. Gebremedhin, Assefaw H. |
author_facet | Patil, Shruti S. Catanese, Helen N. Brayton, Kelly A. Lofgren, Eric T. Gebremedhin, Assefaw H. |
author_sort | Patil, Shruti S. |
collection | PubMed |
description | Severe acute respiratory syndrome-related coronavirus (SARS-CoV-2), which still infects hundreds of thousands of people globally each day despite various countermeasures, has been mutating rapidly. Mutations in the spike (S) protein seem to play a vital role in viral stability, transmission, and adaptability. Therefore, to control the spread of the virus, it is important to gain insight into the evolution and transmission of the S protein. This study deals with the temporal and geographical distribution of mutant S proteins from sequences gathered across the US over a period of 19 months in 2020 and 2021. The S protein sequences are studied using two approaches: (i) multiple sequence alignment is used to identify prominent mutations and highly mutable regions and (ii) sequence similarity networks are subsequently employed to gain further insight and study mutation profiles of concerning variants across the defined time periods and states. Additionally, we tracked the variants using visualizations on geographical maps. The visualizations produced using the Directed Weighted All Nearest Neighbors (DiWANN) networks and maps provided insights into the transmission of the virus that reflect well the statistics reported for the time periods studied. We found that the networks created using DiWANN are superior to commonly used approximate distance networks created using BLAST bitscores. The study offers a richer computational approach to analyze the transmission profile of the prominent S protein mutations in SARS-CoV-2 and can be extended to other proteins and viruses. |
format | Online Article Text |
id | pubmed-9413517 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-94135172022-08-27 Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein Patil, Shruti S. Catanese, Helen N. Brayton, Kelly A. Lofgren, Eric T. Gebremedhin, Assefaw H. Viruses Article Severe acute respiratory syndrome-related coronavirus (SARS-CoV-2), which still infects hundreds of thousands of people globally each day despite various countermeasures, has been mutating rapidly. Mutations in the spike (S) protein seem to play a vital role in viral stability, transmission, and adaptability. Therefore, to control the spread of the virus, it is important to gain insight into the evolution and transmission of the S protein. This study deals with the temporal and geographical distribution of mutant S proteins from sequences gathered across the US over a period of 19 months in 2020 and 2021. The S protein sequences are studied using two approaches: (i) multiple sequence alignment is used to identify prominent mutations and highly mutable regions and (ii) sequence similarity networks are subsequently employed to gain further insight and study mutation profiles of concerning variants across the defined time periods and states. Additionally, we tracked the variants using visualizations on geographical maps. The visualizations produced using the Directed Weighted All Nearest Neighbors (DiWANN) networks and maps provided insights into the transmission of the virus that reflect well the statistics reported for the time periods studied. We found that the networks created using DiWANN are superior to commonly used approximate distance networks created using BLAST bitscores. The study offers a richer computational approach to analyze the transmission profile of the prominent S protein mutations in SARS-CoV-2 and can be extended to other proteins and viruses. MDPI 2022-07-29 /pmc/articles/PMC9413517/ /pubmed/36016294 http://dx.doi.org/10.3390/v14081672 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Patil, Shruti S. Catanese, Helen N. Brayton, Kelly A. Lofgren, Eric T. Gebremedhin, Assefaw H. Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein |
title | Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein |
title_full | Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein |
title_fullStr | Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein |
title_full_unstemmed | Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein |
title_short | Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein |
title_sort | sequence similarity network analysis provides insight into the temporal and geographical distribution of mutations in sars-cov-2 spike protein |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9413517/ https://www.ncbi.nlm.nih.gov/pubmed/36016294 http://dx.doi.org/10.3390/v14081672 |
work_keys_str_mv | AT patilshrutis sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein AT catanesehelenn sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein AT braytonkellya sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein AT lofgrenerict sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein AT gebremedhinassefawh sequencesimilaritynetworkanalysisprovidesinsightintothetemporalandgeographicaldistributionofmutationsinsarscov2spikeprotein |