Cargando…

Weighted-persistent-homology-based machine learning for RNA flexibility analysis

With the great significance of biomolecular flexibility in biomolecular dynamics and functional analysis, various experimental and theoretical models are developed. Experimentally, Debye-Waller factor, also known as B-factor, measures atomic mean-square displacement and is usually considered as an i...

Descripción completa

Detalles Bibliográficos
Autores principales: Pun, Chi Seng, Yong, Brandon Yung Sin, Xia, Kelin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7446851/
https://www.ncbi.nlm.nih.gov/pubmed/32822369
http://dx.doi.org/10.1371/journal.pone.0237747
_version_ 1783574201539493888
author Pun, Chi Seng
Yong, Brandon Yung Sin
Xia, Kelin
author_facet Pun, Chi Seng
Yong, Brandon Yung Sin
Xia, Kelin
author_sort Pun, Chi Seng
collection PubMed
description With the great significance of biomolecular flexibility in biomolecular dynamics and functional analysis, various experimental and theoretical models are developed. Experimentally, Debye-Waller factor, also known as B-factor, measures atomic mean-square displacement and is usually considered as an important measurement for flexibility. Theoretically, elastic network models, Gaussian network model, flexibility-rigidity model, and other computational models have been proposed for flexibility analysis by shedding light on the biomolecular inner topological structures. Recently, a topology-based machine learning model has been proposed. By using the features from persistent homology, this model achieves a remarkable high Pearson correlation coefficient (PCC) in protein B-factor prediction. Motivated by its success, we propose weighted-persistent-homology (WPH)-based machine learning (WPHML) models for RNA flexibility analysis. Our WPH is a newly-proposed model, which incorporate physical, chemical and biological information into topological measurements using a weight function. In particular, we use local persistent homology (LPH) to focus on the topological information of local regions. Our WPHML model is validated on a well-established RNA dataset, and numerical experiments show that our model can achieve a PCC of up to 0.5822. The comparison with the previous sequence-information-based learning models shows that a consistent improvement in performance by at least 10% is achieved in our current model.
format Online
Article
Text
id pubmed-7446851
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-74468512020-08-26 Weighted-persistent-homology-based machine learning for RNA flexibility analysis Pun, Chi Seng Yong, Brandon Yung Sin Xia, Kelin PLoS One Research Article With the great significance of biomolecular flexibility in biomolecular dynamics and functional analysis, various experimental and theoretical models are developed. Experimentally, Debye-Waller factor, also known as B-factor, measures atomic mean-square displacement and is usually considered as an important measurement for flexibility. Theoretically, elastic network models, Gaussian network model, flexibility-rigidity model, and other computational models have been proposed for flexibility analysis by shedding light on the biomolecular inner topological structures. Recently, a topology-based machine learning model has been proposed. By using the features from persistent homology, this model achieves a remarkable high Pearson correlation coefficient (PCC) in protein B-factor prediction. Motivated by its success, we propose weighted-persistent-homology (WPH)-based machine learning (WPHML) models for RNA flexibility analysis. Our WPH is a newly-proposed model, which incorporate physical, chemical and biological information into topological measurements using a weight function. In particular, we use local persistent homology (LPH) to focus on the topological information of local regions. Our WPHML model is validated on a well-established RNA dataset, and numerical experiments show that our model can achieve a PCC of up to 0.5822. The comparison with the previous sequence-information-based learning models shows that a consistent improvement in performance by at least 10% is achieved in our current model. Public Library of Science 2020-08-21 /pmc/articles/PMC7446851/ /pubmed/32822369 http://dx.doi.org/10.1371/journal.pone.0237747 Text en © 2020 Pun et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pun, Chi Seng
Yong, Brandon Yung Sin
Xia, Kelin
Weighted-persistent-homology-based machine learning for RNA flexibility analysis
title Weighted-persistent-homology-based machine learning for RNA flexibility analysis
title_full Weighted-persistent-homology-based machine learning for RNA flexibility analysis
title_fullStr Weighted-persistent-homology-based machine learning for RNA flexibility analysis
title_full_unstemmed Weighted-persistent-homology-based machine learning for RNA flexibility analysis
title_short Weighted-persistent-homology-based machine learning for RNA flexibility analysis
title_sort weighted-persistent-homology-based machine learning for rna flexibility analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7446851/
https://www.ncbi.nlm.nih.gov/pubmed/32822369
http://dx.doi.org/10.1371/journal.pone.0237747
work_keys_str_mv AT punchiseng weightedpersistenthomologybasedmachinelearningforrnaflexibilityanalysis
AT yongbrandonyungsin weightedpersistenthomologybasedmachinelearningforrnaflexibilityanalysis
AT xiakelin weightedpersistenthomologybasedmachinelearningforrnaflexibilityanalysis