Cargando…

Molecular Cavity Topological Representation for Pattern Analysis: A NLP Analogy-Based Word2Vec Method

Cavity analysis in molecular dynamics is important for understanding molecular function. However, analyzing the dynamic pattern of molecular cavities remains a difficult task. In this paper, we propose a novel method to topologically represent molecular cavities by vectorization. First, a characteri...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Dongliang, Wang, Qiaoqiao, Liang, Meng, Liu, Wei, Nie, Junlan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6928730/
https://www.ncbi.nlm.nih.gov/pubmed/31795343
http://dx.doi.org/10.3390/ijms20236019
_version_ 1783482539765137408
author Guo, Dongliang
Wang, Qiaoqiao
Liang, Meng
Liu, Wei
Nie, Junlan
author_facet Guo, Dongliang
Wang, Qiaoqiao
Liang, Meng
Liu, Wei
Nie, Junlan
author_sort Guo, Dongliang
collection PubMed
description Cavity analysis in molecular dynamics is important for understanding molecular function. However, analyzing the dynamic pattern of molecular cavities remains a difficult task. In this paper, we propose a novel method to topologically represent molecular cavities by vectorization. First, a characterization of cavities is established through Word2Vec model, based on an analogy between the cavities and natural language processing (NLP) terms. Then, we use some techniques such as dimension reduction and clustering to conduct an exploratory analysis of the vectorized molecular cavity. On a real data set, we demonstrate that our approach is applicable to maintain the topological characteristics of the cavity and can find the change patterns from a large number of cavities.
format Online
Article
Text
id pubmed-6928730
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-69287302019-12-26 Molecular Cavity Topological Representation for Pattern Analysis: A NLP Analogy-Based Word2Vec Method Guo, Dongliang Wang, Qiaoqiao Liang, Meng Liu, Wei Nie, Junlan Int J Mol Sci Article Cavity analysis in molecular dynamics is important for understanding molecular function. However, analyzing the dynamic pattern of molecular cavities remains a difficult task. In this paper, we propose a novel method to topologically represent molecular cavities by vectorization. First, a characterization of cavities is established through Word2Vec model, based on an analogy between the cavities and natural language processing (NLP) terms. Then, we use some techniques such as dimension reduction and clustering to conduct an exploratory analysis of the vectorized molecular cavity. On a real data set, we demonstrate that our approach is applicable to maintain the topological characteristics of the cavity and can find the change patterns from a large number of cavities. MDPI 2019-11-29 /pmc/articles/PMC6928730/ /pubmed/31795343 http://dx.doi.org/10.3390/ijms20236019 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Guo, Dongliang
Wang, Qiaoqiao
Liang, Meng
Liu, Wei
Nie, Junlan
Molecular Cavity Topological Representation for Pattern Analysis: A NLP Analogy-Based Word2Vec Method
title Molecular Cavity Topological Representation for Pattern Analysis: A NLP Analogy-Based Word2Vec Method
title_full Molecular Cavity Topological Representation for Pattern Analysis: A NLP Analogy-Based Word2Vec Method
title_fullStr Molecular Cavity Topological Representation for Pattern Analysis: A NLP Analogy-Based Word2Vec Method
title_full_unstemmed Molecular Cavity Topological Representation for Pattern Analysis: A NLP Analogy-Based Word2Vec Method
title_short Molecular Cavity Topological Representation for Pattern Analysis: A NLP Analogy-Based Word2Vec Method
title_sort molecular cavity topological representation for pattern analysis: a nlp analogy-based word2vec method
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6928730/
https://www.ncbi.nlm.nih.gov/pubmed/31795343
http://dx.doi.org/10.3390/ijms20236019
work_keys_str_mv AT guodongliang molecularcavitytopologicalrepresentationforpatternanalysisanlpanalogybasedword2vecmethod
AT wangqiaoqiao molecularcavitytopologicalrepresentationforpatternanalysisanlpanalogybasedword2vecmethod
AT liangmeng molecularcavitytopologicalrepresentationforpatternanalysisanlpanalogybasedword2vecmethod
AT liuwei molecularcavitytopologicalrepresentationforpatternanalysisanlpanalogybasedword2vecmethod
AT niejunlan molecularcavitytopologicalrepresentationforpatternanalysisanlpanalogybasedword2vecmethod