Cargando…
Weighted-persistent-homology-based machine learning for RNA flexibility analysis
With the great significance of biomolecular flexibility in biomolecular dynamics and functional analysis, various experimental and theoretical models are developed. Experimentally, Debye-Waller factor, also known as B-factor, measures atomic mean-square displacement and is usually considered as an i...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7446851/ https://www.ncbi.nlm.nih.gov/pubmed/32822369 http://dx.doi.org/10.1371/journal.pone.0237747 |
_version_ | 1783574201539493888 |
---|---|
author | Pun, Chi Seng Yong, Brandon Yung Sin Xia, Kelin |
author_facet | Pun, Chi Seng Yong, Brandon Yung Sin Xia, Kelin |
author_sort | Pun, Chi Seng |
collection | PubMed |
description | With the great significance of biomolecular flexibility in biomolecular dynamics and functional analysis, various experimental and theoretical models are developed. Experimentally, Debye-Waller factor, also known as B-factor, measures atomic mean-square displacement and is usually considered as an important measurement for flexibility. Theoretically, elastic network models, Gaussian network model, flexibility-rigidity model, and other computational models have been proposed for flexibility analysis by shedding light on the biomolecular inner topological structures. Recently, a topology-based machine learning model has been proposed. By using the features from persistent homology, this model achieves a remarkable high Pearson correlation coefficient (PCC) in protein B-factor prediction. Motivated by its success, we propose weighted-persistent-homology (WPH)-based machine learning (WPHML) models for RNA flexibility analysis. Our WPH is a newly-proposed model, which incorporate physical, chemical and biological information into topological measurements using a weight function. In particular, we use local persistent homology (LPH) to focus on the topological information of local regions. Our WPHML model is validated on a well-established RNA dataset, and numerical experiments show that our model can achieve a PCC of up to 0.5822. The comparison with the previous sequence-information-based learning models shows that a consistent improvement in performance by at least 10% is achieved in our current model. |
format | Online Article Text |
id | pubmed-7446851 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-74468512020-08-26 Weighted-persistent-homology-based machine learning for RNA flexibility analysis Pun, Chi Seng Yong, Brandon Yung Sin Xia, Kelin PLoS One Research Article With the great significance of biomolecular flexibility in biomolecular dynamics and functional analysis, various experimental and theoretical models are developed. Experimentally, Debye-Waller factor, also known as B-factor, measures atomic mean-square displacement and is usually considered as an important measurement for flexibility. Theoretically, elastic network models, Gaussian network model, flexibility-rigidity model, and other computational models have been proposed for flexibility analysis by shedding light on the biomolecular inner topological structures. Recently, a topology-based machine learning model has been proposed. By using the features from persistent homology, this model achieves a remarkable high Pearson correlation coefficient (PCC) in protein B-factor prediction. Motivated by its success, we propose weighted-persistent-homology (WPH)-based machine learning (WPHML) models for RNA flexibility analysis. Our WPH is a newly-proposed model, which incorporate physical, chemical and biological information into topological measurements using a weight function. In particular, we use local persistent homology (LPH) to focus on the topological information of local regions. Our WPHML model is validated on a well-established RNA dataset, and numerical experiments show that our model can achieve a PCC of up to 0.5822. The comparison with the previous sequence-information-based learning models shows that a consistent improvement in performance by at least 10% is achieved in our current model. Public Library of Science 2020-08-21 /pmc/articles/PMC7446851/ /pubmed/32822369 http://dx.doi.org/10.1371/journal.pone.0237747 Text en © 2020 Pun et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Pun, Chi Seng Yong, Brandon Yung Sin Xia, Kelin Weighted-persistent-homology-based machine learning for RNA flexibility analysis |
title | Weighted-persistent-homology-based machine learning for RNA flexibility analysis |
title_full | Weighted-persistent-homology-based machine learning for RNA flexibility analysis |
title_fullStr | Weighted-persistent-homology-based machine learning for RNA flexibility analysis |
title_full_unstemmed | Weighted-persistent-homology-based machine learning for RNA flexibility analysis |
title_short | Weighted-persistent-homology-based machine learning for RNA flexibility analysis |
title_sort | weighted-persistent-homology-based machine learning for rna flexibility analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7446851/ https://www.ncbi.nlm.nih.gov/pubmed/32822369 http://dx.doi.org/10.1371/journal.pone.0237747 |
work_keys_str_mv | AT punchiseng weightedpersistenthomologybasedmachinelearningforrnaflexibilityanalysis AT yongbrandonyungsin weightedpersistenthomologybasedmachinelearningforrnaflexibilityanalysis AT xiakelin weightedpersistenthomologybasedmachinelearningforrnaflexibilityanalysis |