Cargando…

CLoNe: automated clustering based on local density neighborhoods for application to biomolecular structural ensembles

MOTIVATION: Proteins are intrinsically dynamic entities. Flexibility sampling methods, such as molecular dynamics or those arising from integrative modeling strategies, are now commonplace and enable the study of molecular conformational landscapes in many contexts. Resulting structural ensembles in...

Descripción completa

Detalles Bibliográficos
Autores principales: Träger, Sylvain, Tamò, Giorgio, Aydin, Deniz, Fonti, Giulia, Audagnotto, Martina, Dal Peraro, Matteo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8128458/
https://www.ncbi.nlm.nih.gov/pubmed/32821900
http://dx.doi.org/10.1093/bioinformatics/btaa742
Descripción
Sumario:MOTIVATION: Proteins are intrinsically dynamic entities. Flexibility sampling methods, such as molecular dynamics or those arising from integrative modeling strategies, are now commonplace and enable the study of molecular conformational landscapes in many contexts. Resulting structural ensembles increase in size as technological and algorithmic advancements take place, making their analysis increasingly demanding. In this regard, cluster analysis remains a go-to approach for their classification. However, many state-of-the-art algorithms are restricted to specific cluster properties. Combined with tedious parameter fine-tuning, cluster analysis of protein structural ensembles suffers from the lack of a generally applicable and easy to use clustering scheme. RESULTS: We present CLoNe, an original Python-based clustering scheme that builds on the Density Peaks algorithm of Rodriguez and Laio. CLoNe relies on a probabilistic analysis of local density distributions derived from nearest neighbors to find relevant clusters regardless of cluster shape, size, distribution and amount. We show its capabilities on many toy datasets with properties otherwise dividing state-of-the-art approaches and improves on the original algorithm in key aspects. Applied to structural ensembles, CLoNe was able to extract meaningful conformations from membrane binding events and ligand-binding pocket opening as well as identify dominant dimerization motifs or inter-domain organization. CLoNe additionally saves clusters as individual trajectories for further analysis and provides scripts for automated use with molecular visualization software. AVAILABILITY AND IMPLEMENTATION: www.epfl.ch/labs/lbm/resources, github.com/LBM-EPFL/CLoNe. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.