Cargando…

Validating clustering of molecular dynamics simulations using polymer models

BACKGROUND: Molecular dynamics (MD) simulation is a powerful technique for sampling the meta-stable and transitional conformations of proteins and other biomolecules. Computational data clustering has emerged as a useful, automated technique for extracting conformational states from MD simulation da...

Descripción completa

Detalles Bibliográficos
Autores principales: Phillips, Joshua L, Colvin, Michael E, Newsam, Shawn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3284309/
https://www.ncbi.nlm.nih.gov/pubmed/22082218
http://dx.doi.org/10.1186/1471-2105-12-445
_version_ 1782224348469264384
author Phillips, Joshua L
Colvin, Michael E
Newsam, Shawn
author_facet Phillips, Joshua L
Colvin, Michael E
Newsam, Shawn
author_sort Phillips, Joshua L
collection PubMed
description BACKGROUND: Molecular dynamics (MD) simulation is a powerful technique for sampling the meta-stable and transitional conformations of proteins and other biomolecules. Computational data clustering has emerged as a useful, automated technique for extracting conformational states from MD simulation data. Despite extensive application, relatively little work has been done to determine if the clustering algorithms are actually extracting useful information. A primary goal of this paper therefore is to provide such an understanding through a detailed analysis of data clustering applied to a series of increasingly complex biopolymer models. RESULTS: We develop a novel series of models using basic polymer theory that have intuitive, clearly-defined dynamics and exhibit the essential properties that we are seeking to identify in MD simulations of real biomolecules. We then apply spectral clustering, an algorithm particularly well-suited for clustering polymer structures, to our models and MD simulations of several intrinsically disordered proteins. Clustering results for the polymer models provide clear evidence that the meta-stable and transitional conformations are detected by the algorithm. The results for the polymer models also help guide the analysis of the disordered protein simulations by comparing and contrasting the statistical properties of the extracted clusters. CONCLUSIONS: We have developed a framework for validating the performance and utility of clustering algorithms for studying molecular biopolymer simulations that utilizes several analytic and dynamic polymer models which exhibit well-behaved dynamics including: meta-stable states, transition states, helical structures, and stochastic dynamics. We show that spectral clustering is robust to anomalies introduced by structural alignment and that different structural classes of intrinsically disordered proteins can be reliably discriminated from the clustering results. To our knowledge, our framework is the first to utilize model polymers to rigorously test the utility of clustering algorithms for studying biopolymers.
format Online
Article
Text
id pubmed-3284309
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32843092012-02-23 Validating clustering of molecular dynamics simulations using polymer models Phillips, Joshua L Colvin, Michael E Newsam, Shawn BMC Bioinformatics Methodology Article BACKGROUND: Molecular dynamics (MD) simulation is a powerful technique for sampling the meta-stable and transitional conformations of proteins and other biomolecules. Computational data clustering has emerged as a useful, automated technique for extracting conformational states from MD simulation data. Despite extensive application, relatively little work has been done to determine if the clustering algorithms are actually extracting useful information. A primary goal of this paper therefore is to provide such an understanding through a detailed analysis of data clustering applied to a series of increasingly complex biopolymer models. RESULTS: We develop a novel series of models using basic polymer theory that have intuitive, clearly-defined dynamics and exhibit the essential properties that we are seeking to identify in MD simulations of real biomolecules. We then apply spectral clustering, an algorithm particularly well-suited for clustering polymer structures, to our models and MD simulations of several intrinsically disordered proteins. Clustering results for the polymer models provide clear evidence that the meta-stable and transitional conformations are detected by the algorithm. The results for the polymer models also help guide the analysis of the disordered protein simulations by comparing and contrasting the statistical properties of the extracted clusters. CONCLUSIONS: We have developed a framework for validating the performance and utility of clustering algorithms for studying molecular biopolymer simulations that utilizes several analytic and dynamic polymer models which exhibit well-behaved dynamics including: meta-stable states, transition states, helical structures, and stochastic dynamics. We show that spectral clustering is robust to anomalies introduced by structural alignment and that different structural classes of intrinsically disordered proteins can be reliably discriminated from the clustering results. To our knowledge, our framework is the first to utilize model polymers to rigorously test the utility of clustering algorithms for studying biopolymers. BioMed Central 2011-11-14 /pmc/articles/PMC3284309/ /pubmed/22082218 http://dx.doi.org/10.1186/1471-2105-12-445 Text en Copyright ©2011 Phillips et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Phillips, Joshua L
Colvin, Michael E
Newsam, Shawn
Validating clustering of molecular dynamics simulations using polymer models
title Validating clustering of molecular dynamics simulations using polymer models
title_full Validating clustering of molecular dynamics simulations using polymer models
title_fullStr Validating clustering of molecular dynamics simulations using polymer models
title_full_unstemmed Validating clustering of molecular dynamics simulations using polymer models
title_short Validating clustering of molecular dynamics simulations using polymer models
title_sort validating clustering of molecular dynamics simulations using polymer models
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3284309/
https://www.ncbi.nlm.nih.gov/pubmed/22082218
http://dx.doi.org/10.1186/1471-2105-12-445
work_keys_str_mv AT phillipsjoshual validatingclusteringofmoleculardynamicssimulationsusingpolymermodels
AT colvinmichaele validatingclusteringofmoleculardynamicssimulationsusingpolymermodels
AT newsamshawn validatingclusteringofmoleculardynamicssimulationsusingpolymermodels