Cargando…

Persistent topological Laplacian analysis of SARS-CoV-2 variants

Topological data analysis (TDA) is an emerging field in mathematics and data science. Its central technique, persistent homology, has had tremendous success in many science and engineering disciplines. However, persistent homology has limitations, including its inability to handle heterogeneous info...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Xiaoqi, Chen, Jiahui, Guo-Wei, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cornell University 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9900960/
https://www.ncbi.nlm.nih.gov/pubmed/36748007
_version_ 1784882945052377088
author Wei, Xiaoqi
Chen, Jiahui
Guo-Wei, Wei
author_facet Wei, Xiaoqi
Chen, Jiahui
Guo-Wei, Wei
author_sort Wei, Xiaoqi
collection PubMed
description Topological data analysis (TDA) is an emerging field in mathematics and data science. Its central technique, persistent homology, has had tremendous success in many science and engineering disciplines. However, persistent homology has limitations, including its inability to handle heterogeneous information, such as multiple types of geometric objects; being qualitative rather than quantitative, e.g., counting a 5-member ring the same as a 6-member ring, and a failure to describe non-topological changes, such as homotopic changes in protein-protein binding. Persistent topological Laplacians (PTLs), such as persistent Laplacian and persistent sheaf Laplacian, were proposed to overcome the limitations of persistent homology. In this work, we examine the modeling and analysis power of PTLs in the study of the protein structures of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike receptor binding domain (RBD). First, we employ PTLs to study how the RBD mutation-induced structural changes of RBD-angiotensin-converting enzyme 2 (ACE2) binding complexes are captured in the changes of spectra of the PTLs among SARS-CoV-2 variants. Additionally, we use PTLs to analyze the binding of RBD and ACE2-induced structural changes of various SARS-CoV-2 variants. Finally, we explore the impacts of computationally generated RBD structures on a topological deep learning paradigm and predictions of deep mutational scanning datasets for the SARS-CoV-2 Omicron BA.2 variant. Our results indicate that PTLs have advantages over persistent homology in analyzing protein structural changes and provide a powerful new TDA tool for data science.
format Online
Article
Text
id pubmed-9900960
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cornell University
record_format MEDLINE/PubMed
spelling pubmed-99009602023-02-07 Persistent topological Laplacian analysis of SARS-CoV-2 variants Wei, Xiaoqi Chen, Jiahui Guo-Wei, Wei ArXiv Article Topological data analysis (TDA) is an emerging field in mathematics and data science. Its central technique, persistent homology, has had tremendous success in many science and engineering disciplines. However, persistent homology has limitations, including its inability to handle heterogeneous information, such as multiple types of geometric objects; being qualitative rather than quantitative, e.g., counting a 5-member ring the same as a 6-member ring, and a failure to describe non-topological changes, such as homotopic changes in protein-protein binding. Persistent topological Laplacians (PTLs), such as persistent Laplacian and persistent sheaf Laplacian, were proposed to overcome the limitations of persistent homology. In this work, we examine the modeling and analysis power of PTLs in the study of the protein structures of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike receptor binding domain (RBD). First, we employ PTLs to study how the RBD mutation-induced structural changes of RBD-angiotensin-converting enzyme 2 (ACE2) binding complexes are captured in the changes of spectra of the PTLs among SARS-CoV-2 variants. Additionally, we use PTLs to analyze the binding of RBD and ACE2-induced structural changes of various SARS-CoV-2 variants. Finally, we explore the impacts of computationally generated RBD structures on a topological deep learning paradigm and predictions of deep mutational scanning datasets for the SARS-CoV-2 Omicron BA.2 variant. Our results indicate that PTLs have advantages over persistent homology in analyzing protein structural changes and provide a powerful new TDA tool for data science. Cornell University 2023-04-06 /pmc/articles/PMC9900960/ /pubmed/36748007 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Wei, Xiaoqi
Chen, Jiahui
Guo-Wei, Wei
Persistent topological Laplacian analysis of SARS-CoV-2 variants
title Persistent topological Laplacian analysis of SARS-CoV-2 variants
title_full Persistent topological Laplacian analysis of SARS-CoV-2 variants
title_fullStr Persistent topological Laplacian analysis of SARS-CoV-2 variants
title_full_unstemmed Persistent topological Laplacian analysis of SARS-CoV-2 variants
title_short Persistent topological Laplacian analysis of SARS-CoV-2 variants
title_sort persistent topological laplacian analysis of sars-cov-2 variants
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9900960/
https://www.ncbi.nlm.nih.gov/pubmed/36748007
work_keys_str_mv AT weixiaoqi persistenttopologicallaplaciananalysisofsarscov2variants
AT chenjiahui persistenttopologicallaplaciananalysisofsarscov2variants
AT guoweiwei persistenttopologicallaplaciananalysisofsarscov2variants