Cargando…

Weighted persistent homology for biomolecular data analysis

In this paper, we systematically review weighted persistent homology (WPH) models and their applications in biomolecular data analysis. Essentially, the weight value, which reflects physical, chemical and biological properties, can be assigned to vertices (atom centers), edges (bonds), or higher ord...

Descripción completa

Detalles Bibliográficos
Autores principales: Meng, Zhenyu, Anand, D. Vijay, Lu, Yunpeng, Wu, Jie, Xia, Kelin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7005716/
https://www.ncbi.nlm.nih.gov/pubmed/32034168
http://dx.doi.org/10.1038/s41598-019-55660-3
_version_ 1783494996425441280
author Meng, Zhenyu
Anand, D. Vijay
Lu, Yunpeng
Wu, Jie
Xia, Kelin
author_facet Meng, Zhenyu
Anand, D. Vijay
Lu, Yunpeng
Wu, Jie
Xia, Kelin
author_sort Meng, Zhenyu
collection PubMed
description In this paper, we systematically review weighted persistent homology (WPH) models and their applications in biomolecular data analysis. Essentially, the weight value, which reflects physical, chemical and biological properties, can be assigned to vertices (atom centers), edges (bonds), or higher order simplexes (cluster of atoms), depending on the biomolecular structure, function, and dynamics properties. Further, we propose the first localized weighted persistent homology (LWPH). Inspired by the great success of element specific persistent homology (ESPH), we do not treat biomolecules as an inseparable system like all previous weighted models, instead we decompose them into a series of local domains, which may be overlapped with each other. The general persistent homology or weighted persistent homology analysis is then applied on each of these local domains. In this way, functional properties, that are embedded in local structures, can be revealed. Our model has been applied to systematically study DNA structures. It has been found that our LWPH based features can be used to successfully discriminate the A-, B-, and Z-types of DNA. More importantly, our LWPH based principal component analysis (PCA) model can identify two configurational states of DNA structures in ion liquid environment, which can be revealed only by the complicated helical coordinate system. The great consistence with the helical-coordinate model demonstrates that our model captures local structure variations so well that it is comparable with geometric models. Moreover, geometric measurements are usually defined in local regions. For instance, the helical-coordinate system is limited to one or two basepairs. However, our LWPH can quantitatively characterize structure information in regions or domains with arbitrary sizes and shapes, where traditional geometrical measurements fail.
format Online
Article
Text
id pubmed-7005716
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-70057162020-02-18 Weighted persistent homology for biomolecular data analysis Meng, Zhenyu Anand, D. Vijay Lu, Yunpeng Wu, Jie Xia, Kelin Sci Rep Article In this paper, we systematically review weighted persistent homology (WPH) models and their applications in biomolecular data analysis. Essentially, the weight value, which reflects physical, chemical and biological properties, can be assigned to vertices (atom centers), edges (bonds), or higher order simplexes (cluster of atoms), depending on the biomolecular structure, function, and dynamics properties. Further, we propose the first localized weighted persistent homology (LWPH). Inspired by the great success of element specific persistent homology (ESPH), we do not treat biomolecules as an inseparable system like all previous weighted models, instead we decompose them into a series of local domains, which may be overlapped with each other. The general persistent homology or weighted persistent homology analysis is then applied on each of these local domains. In this way, functional properties, that are embedded in local structures, can be revealed. Our model has been applied to systematically study DNA structures. It has been found that our LWPH based features can be used to successfully discriminate the A-, B-, and Z-types of DNA. More importantly, our LWPH based principal component analysis (PCA) model can identify two configurational states of DNA structures in ion liquid environment, which can be revealed only by the complicated helical coordinate system. The great consistence with the helical-coordinate model demonstrates that our model captures local structure variations so well that it is comparable with geometric models. Moreover, geometric measurements are usually defined in local regions. For instance, the helical-coordinate system is limited to one or two basepairs. However, our LWPH can quantitatively characterize structure information in regions or domains with arbitrary sizes and shapes, where traditional geometrical measurements fail. Nature Publishing Group UK 2020-02-07 /pmc/articles/PMC7005716/ /pubmed/32034168 http://dx.doi.org/10.1038/s41598-019-55660-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Meng, Zhenyu
Anand, D. Vijay
Lu, Yunpeng
Wu, Jie
Xia, Kelin
Weighted persistent homology for biomolecular data analysis
title Weighted persistent homology for biomolecular data analysis
title_full Weighted persistent homology for biomolecular data analysis
title_fullStr Weighted persistent homology for biomolecular data analysis
title_full_unstemmed Weighted persistent homology for biomolecular data analysis
title_short Weighted persistent homology for biomolecular data analysis
title_sort weighted persistent homology for biomolecular data analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7005716/
https://www.ncbi.nlm.nih.gov/pubmed/32034168
http://dx.doi.org/10.1038/s41598-019-55660-3
work_keys_str_mv AT mengzhenyu weightedpersistenthomologyforbiomoleculardataanalysis
AT ananddvijay weightedpersistenthomologyforbiomoleculardataanalysis
AT luyunpeng weightedpersistenthomologyforbiomoleculardataanalysis
AT wujie weightedpersistenthomologyforbiomoleculardataanalysis
AT xiakelin weightedpersistenthomologyforbiomoleculardataanalysis