Cargando…
LPATH: A semi-automated Python tool for clustering molecular pathways
The pathways by which a molecular process transitions to a target state are highly sought-after as direct views of a transition mechanism. While great strides have been made in the physics-based simulation of such pathways, the analysis of these pathways can be a major challenge due to their diversi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462149/ https://www.ncbi.nlm.nih.gov/pubmed/37645995 http://dx.doi.org/10.1101/2023.08.17.553774 |
_version_ | 1785097997824032768 |
---|---|
author | Bogetti, Anthony T. Leung, Jeremy M. G. Chong, Lillian T. |
author_facet | Bogetti, Anthony T. Leung, Jeremy M. G. Chong, Lillian T. |
author_sort | Bogetti, Anthony T. |
collection | PubMed |
description | The pathways by which a molecular process transitions to a target state are highly sought-after as direct views of a transition mechanism. While great strides have been made in the physics-based simulation of such pathways, the analysis of these pathways can be a major challenge due to their diversity and variable lengths. Here we present the LPATH Python tool, which implements a semi-automated method for linguistics-assisted clustering of pathways into distinct classes (or routes). This method involves three steps: 1) discretizing the configurational space into key states, 2) extracting a text-string sequence of key visited states for each pathway, and 3) pairwise matching of pathways based on a text-string similarity score. To circumvent the prohibitive memory requirements of the first step, we have implemented a general two-stage method for clustering conformational states that exploits machine learning. LPATH is primarily designed for use with the WESTPA software for weighted ensemble simulations; however, the tool can also be applied to conventional simulations. As demonstrated for the C7(eq) to C7(ax) conformational transition of alanine dipeptide, LPATH provides physically reasonable classes of pathways and corresponding probabilities. |
format | Online Article Text |
id | pubmed-10462149 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-104621492023-08-29 LPATH: A semi-automated Python tool for clustering molecular pathways Bogetti, Anthony T. Leung, Jeremy M. G. Chong, Lillian T. bioRxiv Article The pathways by which a molecular process transitions to a target state are highly sought-after as direct views of a transition mechanism. While great strides have been made in the physics-based simulation of such pathways, the analysis of these pathways can be a major challenge due to their diversity and variable lengths. Here we present the LPATH Python tool, which implements a semi-automated method for linguistics-assisted clustering of pathways into distinct classes (or routes). This method involves three steps: 1) discretizing the configurational space into key states, 2) extracting a text-string sequence of key visited states for each pathway, and 3) pairwise matching of pathways based on a text-string similarity score. To circumvent the prohibitive memory requirements of the first step, we have implemented a general two-stage method for clustering conformational states that exploits machine learning. LPATH is primarily designed for use with the WESTPA software for weighted ensemble simulations; however, the tool can also be applied to conventional simulations. As demonstrated for the C7(eq) to C7(ax) conformational transition of alanine dipeptide, LPATH provides physically reasonable classes of pathways and corresponding probabilities. Cold Spring Harbor Laboratory 2023-10-19 /pmc/articles/PMC10462149/ /pubmed/37645995 http://dx.doi.org/10.1101/2023.08.17.553774 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Bogetti, Anthony T. Leung, Jeremy M. G. Chong, Lillian T. LPATH: A semi-automated Python tool for clustering molecular pathways |
title | LPATH: A semi-automated Python tool for clustering molecular pathways |
title_full | LPATH: A semi-automated Python tool for clustering molecular pathways |
title_fullStr | LPATH: A semi-automated Python tool for clustering molecular pathways |
title_full_unstemmed | LPATH: A semi-automated Python tool for clustering molecular pathways |
title_short | LPATH: A semi-automated Python tool for clustering molecular pathways |
title_sort | lpath: a semi-automated python tool for clustering molecular pathways |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462149/ https://www.ncbi.nlm.nih.gov/pubmed/37645995 http://dx.doi.org/10.1101/2023.08.17.553774 |
work_keys_str_mv | AT bogettianthonyt lpathasemiautomatedpythontoolforclusteringmolecularpathways AT leungjeremymg lpathasemiautomatedpythontoolforclusteringmolecularpathways AT chonglilliant lpathasemiautomatedpythontoolforclusteringmolecularpathways |