Cargando…

JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories

BACKGROUND: Essential Dynamics (ED) is a common application of principal component analysis (PCA) to extract biologically relevant motions from atomic trajectories of proteins. Covariance and correlation based PCA are two common approaches to determine PCA modes (eigenvectors) and their eigenvalues....

Descripción completa

Detalles Bibliográficos
Autores principales: David, Charles C., Singam, Ettayapuram Ramaprasad Azhagiya, Jacobs, Donald J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445469/
https://www.ncbi.nlm.nih.gov/pubmed/28545397
http://dx.doi.org/10.1186/s12859-017-1676-y
_version_ 1783238898322767872
author David, Charles C.
Singam, Ettayapuram Ramaprasad Azhagiya
Jacobs, Donald J.
author_facet David, Charles C.
Singam, Ettayapuram Ramaprasad Azhagiya
Jacobs, Donald J.
author_sort David, Charles C.
collection PubMed
description BACKGROUND: Essential Dynamics (ED) is a common application of principal component analysis (PCA) to extract biologically relevant motions from atomic trajectories of proteins. Covariance and correlation based PCA are two common approaches to determine PCA modes (eigenvectors) and their eigenvalues. Protein dynamics can be characterized in terms of Cartesian coordinates or internal distance pairs. In understanding protein dynamics, a comparison of trajectories taken from a set of proteins for similarity assessment provides insight into conserved mechanisms. Comprehensive software is needed to facilitate comparative-analysis with user-friendly features that are rooted in best practices from multivariate statistics. RESULTS: We developed a Java based Essential Dynamics toolkit called JED to compare the ED from multiple protein trajectories. Trajectories from different simulations and different proteins can be pooled for comparative studies. JED implements Cartesian-based coordinates (cPCA) and internal distance pair coordinates (dpPCA) as options to construct covariance (Q) or correlation (R) matrices. Statistical methods are implemented for treating outliers, benchmarking sampling adequacy, characterizing the precision of Q and R, and reporting partial correlations. JED output results as text files that include transformed coordinates for aligned structures, several metrics that quantify protein mobility, PCA modes with their eigenvalues, and displacement vector (DV) projections onto the top principal modes. Pymol scripts together with PDB files allow movies of individual Q- and R-cPCA modes to be visualized, and the essential dynamics occurring within user-selected time scales. Subspaces defined by the top eigenvectors are compared using several statistical metrics to quantify similarity/overlap of high dimensional vector spaces. Free energy landscapes can be generated for both cPCA and dpPCA. CONCLUSIONS: JED offers a convenient toolkit that encourages best practices in applying multivariate statistics methods to perform comparative studies of essential dynamics over multiple proteins. For each protein, Cartesian coordinates or internal distance pairs can be employed over the entire structure or user-selected parts to quantify similarity/differences in mobility and correlations in dynamics to develop insight into protein structure/function relationships. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1676-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5445469
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54454692017-05-30 JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories David, Charles C. Singam, Ettayapuram Ramaprasad Azhagiya Jacobs, Donald J. BMC Bioinformatics Software BACKGROUND: Essential Dynamics (ED) is a common application of principal component analysis (PCA) to extract biologically relevant motions from atomic trajectories of proteins. Covariance and correlation based PCA are two common approaches to determine PCA modes (eigenvectors) and their eigenvalues. Protein dynamics can be characterized in terms of Cartesian coordinates or internal distance pairs. In understanding protein dynamics, a comparison of trajectories taken from a set of proteins for similarity assessment provides insight into conserved mechanisms. Comprehensive software is needed to facilitate comparative-analysis with user-friendly features that are rooted in best practices from multivariate statistics. RESULTS: We developed a Java based Essential Dynamics toolkit called JED to compare the ED from multiple protein trajectories. Trajectories from different simulations and different proteins can be pooled for comparative studies. JED implements Cartesian-based coordinates (cPCA) and internal distance pair coordinates (dpPCA) as options to construct covariance (Q) or correlation (R) matrices. Statistical methods are implemented for treating outliers, benchmarking sampling adequacy, characterizing the precision of Q and R, and reporting partial correlations. JED output results as text files that include transformed coordinates for aligned structures, several metrics that quantify protein mobility, PCA modes with their eigenvalues, and displacement vector (DV) projections onto the top principal modes. Pymol scripts together with PDB files allow movies of individual Q- and R-cPCA modes to be visualized, and the essential dynamics occurring within user-selected time scales. Subspaces defined by the top eigenvectors are compared using several statistical metrics to quantify similarity/overlap of high dimensional vector spaces. Free energy landscapes can be generated for both cPCA and dpPCA. CONCLUSIONS: JED offers a convenient toolkit that encourages best practices in applying multivariate statistics methods to perform comparative studies of essential dynamics over multiple proteins. For each protein, Cartesian coordinates or internal distance pairs can be employed over the entire structure or user-selected parts to quantify similarity/differences in mobility and correlations in dynamics to develop insight into protein structure/function relationships. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1676-y) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-25 /pmc/articles/PMC5445469/ /pubmed/28545397 http://dx.doi.org/10.1186/s12859-017-1676-y Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
David, Charles C.
Singam, Ettayapuram Ramaprasad Azhagiya
Jacobs, Donald J.
JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories
title JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories
title_full JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories
title_fullStr JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories
title_full_unstemmed JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories
title_short JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories
title_sort jed: a java essential dynamics program for comparative analysis of protein trajectories
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445469/
https://www.ncbi.nlm.nih.gov/pubmed/28545397
http://dx.doi.org/10.1186/s12859-017-1676-y
work_keys_str_mv AT davidcharlesc jedajavaessentialdynamicsprogramforcomparativeanalysisofproteintrajectories
AT singamettayapuramramaprasadazhagiya jedajavaessentialdynamicsprogramforcomparativeanalysisofproteintrajectories
AT jacobsdonaldj jedajavaessentialdynamicsprogramforcomparativeanalysisofproteintrajectories