Cargando…

Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories

Molecular simulation trajectories represent high-dimensional data. Such data can be visualized by methods of dimensionality reduction. Non-linear dimensionality reduction methods are likely to be more efficient than linear ones due to the fact that motions of atoms are non-linear. Here we test a pop...

Descripción completa

Detalles Bibliográficos
Autores principales: Spiwok, Vojtěch, Kříž, Pavel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7344294/
https://www.ncbi.nlm.nih.gov/pubmed/32714941
http://dx.doi.org/10.3389/fmolb.2020.00132
_version_ 1783555915579916288
author Spiwok, Vojtěch
Kříž, Pavel
author_facet Spiwok, Vojtěch
Kříž, Pavel
author_sort Spiwok, Vojtěch
collection PubMed
description Molecular simulation trajectories represent high-dimensional data. Such data can be visualized by methods of dimensionality reduction. Non-linear dimensionality reduction methods are likely to be more efficient than linear ones due to the fact that motions of atoms are non-linear. Here we test a popular non-linear t-distributed Stochastic Neighbor Embedding (t-SNE) method on analysis of trajectories of 200 ns alanine dipeptide dynamics and 208 μs Trp-cage folding and unfolding. Furthermore, we introduce a time-lagged variant of t-SNE in order to focus on rarely occurring transitions in the molecular system. This time-lagged t-SNE efficiently separates states according to distance in time. Using this method it is possible to visualize key states of studied systems (e.g., unfolded and folded protein) as well as possible kinetic traps using a two-dimensional plot. Time-lagged t-SNE is a visualization method and other applications, such as clustering and free energy modeling, must be done with caution.
format Online
Article
Text
id pubmed-7344294
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-73442942020-07-25 Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories Spiwok, Vojtěch Kříž, Pavel Front Mol Biosci Molecular Biosciences Molecular simulation trajectories represent high-dimensional data. Such data can be visualized by methods of dimensionality reduction. Non-linear dimensionality reduction methods are likely to be more efficient than linear ones due to the fact that motions of atoms are non-linear. Here we test a popular non-linear t-distributed Stochastic Neighbor Embedding (t-SNE) method on analysis of trajectories of 200 ns alanine dipeptide dynamics and 208 μs Trp-cage folding and unfolding. Furthermore, we introduce a time-lagged variant of t-SNE in order to focus on rarely occurring transitions in the molecular system. This time-lagged t-SNE efficiently separates states according to distance in time. Using this method it is possible to visualize key states of studied systems (e.g., unfolded and folded protein) as well as possible kinetic traps using a two-dimensional plot. Time-lagged t-SNE is a visualization method and other applications, such as clustering and free energy modeling, must be done with caution. Frontiers Media S.A. 2020-06-30 /pmc/articles/PMC7344294/ /pubmed/32714941 http://dx.doi.org/10.3389/fmolb.2020.00132 Text en Copyright © 2020 Spiwok and Kříž. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Spiwok, Vojtěch
Kříž, Pavel
Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories
title Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories
title_full Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories
title_fullStr Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories
title_full_unstemmed Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories
title_short Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories
title_sort time-lagged t-distributed stochastic neighbor embedding (t-sne) of molecular simulation trajectories
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7344294/
https://www.ncbi.nlm.nih.gov/pubmed/32714941
http://dx.doi.org/10.3389/fmolb.2020.00132
work_keys_str_mv AT spiwokvojtech timelaggedtdistributedstochasticneighborembeddingtsneofmolecularsimulationtrajectories
AT krizpavel timelaggedtdistributedstochasticneighborembeddingtsneofmolecularsimulationtrajectories