Cargando…

LPATH: A semi-automated Python tool for clustering molecular pathways

The pathways by which a molecular process transitions to a target state are highly sought-after as direct views of a transition mechanism. While great strides have been made in the physics-based simulation of such pathways, the analysis of these pathways can be a major challenge due to their diversi...

Descripción completa

Detalles Bibliográficos
Autores principales: Bogetti, Anthony T., Leung, Jeremy M. G., Chong, Lillian T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462149/
https://www.ncbi.nlm.nih.gov/pubmed/37645995
http://dx.doi.org/10.1101/2023.08.17.553774
_version_ 1785097997824032768
author Bogetti, Anthony T.
Leung, Jeremy M. G.
Chong, Lillian T.
author_facet Bogetti, Anthony T.
Leung, Jeremy M. G.
Chong, Lillian T.
author_sort Bogetti, Anthony T.
collection PubMed
description The pathways by which a molecular process transitions to a target state are highly sought-after as direct views of a transition mechanism. While great strides have been made in the physics-based simulation of such pathways, the analysis of these pathways can be a major challenge due to their diversity and variable lengths. Here we present the LPATH Python tool, which implements a semi-automated method for linguistics-assisted clustering of pathways into distinct classes (or routes). This method involves three steps: 1) discretizing the configurational space into key states, 2) extracting a text-string sequence of key visited states for each pathway, and 3) pairwise matching of pathways based on a text-string similarity score. To circumvent the prohibitive memory requirements of the first step, we have implemented a general two-stage method for clustering conformational states that exploits machine learning. LPATH is primarily designed for use with the WESTPA software for weighted ensemble simulations; however, the tool can also be applied to conventional simulations. As demonstrated for the C7(eq) to C7(ax) conformational transition of alanine dipeptide, LPATH provides physically reasonable classes of pathways and corresponding probabilities.
format Online
Article
Text
id pubmed-10462149
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104621492023-08-29 LPATH: A semi-automated Python tool for clustering molecular pathways Bogetti, Anthony T. Leung, Jeremy M. G. Chong, Lillian T. bioRxiv Article The pathways by which a molecular process transitions to a target state are highly sought-after as direct views of a transition mechanism. While great strides have been made in the physics-based simulation of such pathways, the analysis of these pathways can be a major challenge due to their diversity and variable lengths. Here we present the LPATH Python tool, which implements a semi-automated method for linguistics-assisted clustering of pathways into distinct classes (or routes). This method involves three steps: 1) discretizing the configurational space into key states, 2) extracting a text-string sequence of key visited states for each pathway, and 3) pairwise matching of pathways based on a text-string similarity score. To circumvent the prohibitive memory requirements of the first step, we have implemented a general two-stage method for clustering conformational states that exploits machine learning. LPATH is primarily designed for use with the WESTPA software for weighted ensemble simulations; however, the tool can also be applied to conventional simulations. As demonstrated for the C7(eq) to C7(ax) conformational transition of alanine dipeptide, LPATH provides physically reasonable classes of pathways and corresponding probabilities. Cold Spring Harbor Laboratory 2023-10-19 /pmc/articles/PMC10462149/ /pubmed/37645995 http://dx.doi.org/10.1101/2023.08.17.553774 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Bogetti, Anthony T.
Leung, Jeremy M. G.
Chong, Lillian T.
LPATH: A semi-automated Python tool for clustering molecular pathways
title LPATH: A semi-automated Python tool for clustering molecular pathways
title_full LPATH: A semi-automated Python tool for clustering molecular pathways
title_fullStr LPATH: A semi-automated Python tool for clustering molecular pathways
title_full_unstemmed LPATH: A semi-automated Python tool for clustering molecular pathways
title_short LPATH: A semi-automated Python tool for clustering molecular pathways
title_sort lpath: a semi-automated python tool for clustering molecular pathways
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462149/
https://www.ncbi.nlm.nih.gov/pubmed/37645995
http://dx.doi.org/10.1101/2023.08.17.553774
work_keys_str_mv AT bogettianthonyt lpathasemiautomatedpythontoolforclusteringmolecularpathways
AT leungjeremymg lpathasemiautomatedpythontoolforclusteringmolecularpathways
AT chonglilliant lpathasemiautomatedpythontoolforclusteringmolecularpathways