Cargando…

Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain

With improvements in computer speed and algorithm efficiency, MD simulations are sampling larger amounts of molecular and biomolecular conformations. Being able to qualitatively and quantitatively sift these conformations into meaningful groups is a difficult and important task, especially when cons...

Descripción completa

Detalles Bibliográficos
Autores principales: Wolf, Antje, Kirschner, Karl N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer-Verlag 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592554/
https://www.ncbi.nlm.nih.gov/pubmed/22961589
http://dx.doi.org/10.1007/s00894-012-1563-4
_version_ 1782262133552054272
author Wolf, Antje
Kirschner, Karl N.
author_facet Wolf, Antje
Kirschner, Karl N.
author_sort Wolf, Antje
collection PubMed
description With improvements in computer speed and algorithm efficiency, MD simulations are sampling larger amounts of molecular and biomolecular conformations. Being able to qualitatively and quantitatively sift these conformations into meaningful groups is a difficult and important task, especially when considering the structure-activity paradigm. Here we present a study that combines two popular techniques, principal component (PC) analysis and clustering, for revealing major conformational changes that occur in molecular dynamics (MD) simulations. Specifically, we explored how clustering different PC subspaces effects the resulting clusters versus clustering the complete trajectory data. As a case example, we used the trajectory data from an explicitly solvated simulation of a bacteria’s L11·23S ribosomal subdomain, which is a target of thiopeptide antibiotics. Clustering was performed, using K-means and average-linkage algorithms, on data involving the first two to the first five PC subspace dimensions. For the average-linkage algorithm we found that data-point membership, cluster shape, and cluster size depended on the selected PC subspace data. In contrast, K-means provided very consistent results regardless of the selected subspace. Since we present results on a single model system, generalization concerning the clustering of different PC subspaces of other molecular systems is currently premature. However, our hope is that this study illustrates a) the complexities in selecting the appropriate clustering algorithm, b) the complexities in interpreting and validating their results, and c) by combining PC analysis with subsequent clustering valuable dynamic and conformational information can be obtained. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00894-012-1563-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-3592554
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Springer-Verlag
record_format MEDLINE/PubMed
spelling pubmed-35925542013-03-11 Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain Wolf, Antje Kirschner, Karl N. J Mol Model Original Paper With improvements in computer speed and algorithm efficiency, MD simulations are sampling larger amounts of molecular and biomolecular conformations. Being able to qualitatively and quantitatively sift these conformations into meaningful groups is a difficult and important task, especially when considering the structure-activity paradigm. Here we present a study that combines two popular techniques, principal component (PC) analysis and clustering, for revealing major conformational changes that occur in molecular dynamics (MD) simulations. Specifically, we explored how clustering different PC subspaces effects the resulting clusters versus clustering the complete trajectory data. As a case example, we used the trajectory data from an explicitly solvated simulation of a bacteria’s L11·23S ribosomal subdomain, which is a target of thiopeptide antibiotics. Clustering was performed, using K-means and average-linkage algorithms, on data involving the first two to the first five PC subspace dimensions. For the average-linkage algorithm we found that data-point membership, cluster shape, and cluster size depended on the selected PC subspace data. In contrast, K-means provided very consistent results regardless of the selected subspace. Since we present results on a single model system, generalization concerning the clustering of different PC subspaces of other molecular systems is currently premature. However, our hope is that this study illustrates a) the complexities in selecting the appropriate clustering algorithm, b) the complexities in interpreting and validating their results, and c) by combining PC analysis with subsequent clustering valuable dynamic and conformational information can be obtained. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00894-012-1563-4) contains supplementary material, which is available to authorized users. Springer-Verlag 2012-09-08 2013 /pmc/articles/PMC3592554/ /pubmed/22961589 http://dx.doi.org/10.1007/s00894-012-1563-4 Text en © The Author(s) 2012 https://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
spellingShingle Original Paper
Wolf, Antje
Kirschner, Karl N.
Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain
title Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain
title_full Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain
title_fullStr Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain
title_full_unstemmed Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain
title_short Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain
title_sort principal component and clustering analysis on molecular dynamics data of the ribosomal l11·23s subdomain
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592554/
https://www.ncbi.nlm.nih.gov/pubmed/22961589
http://dx.doi.org/10.1007/s00894-012-1563-4
work_keys_str_mv AT wolfantje principalcomponentandclusteringanalysisonmoleculardynamicsdataoftheribosomall1123ssubdomain
AT kirschnerkarln principalcomponentandclusteringanalysisonmoleculardynamicsdataoftheribosomall1123ssubdomain