Cargando…

SiteFerret: Beyond Simple Pocket Identification in Proteins

[Image: see text] We present a novel method for the automatic detection of pockets on protein molecular surfaces. The algorithm is based on an ad hoc hierarchical clustering of virtual probe spheres obtained from the geometrical primitives used by the NanoShaper software to build the solvent-exclude...

Descripción completa

Detalles Bibliográficos
Autores principales: Gagliardi, Luca, Rocchia, Walter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10413863/
https://www.ncbi.nlm.nih.gov/pubmed/37470784
http://dx.doi.org/10.1021/acs.jctc.2c01306
_version_ 1785087222203023360
author Gagliardi, Luca
Rocchia, Walter
author_facet Gagliardi, Luca
Rocchia, Walter
author_sort Gagliardi, Luca
collection PubMed
description [Image: see text] We present a novel method for the automatic detection of pockets on protein molecular surfaces. The algorithm is based on an ad hoc hierarchical clustering of virtual probe spheres obtained from the geometrical primitives used by the NanoShaper software to build the solvent-excluded molecular surface. The final ranking of putative pockets is based on the Isolation Forest method, an unsupervised learning approach originally developed for anomaly detection. A detailed importance analysis of pocket features provides insight into which geometrical (clustering) and chemical (amino acidic composition) properties characterize a good binding site. The method also provides a segmentation of pockets into smaller subpockets. We prove that subpockets are a convenient representation to pinpoint the binding site with great precision. SiteFerret is outstanding in its versatility, accurately predicting a wide range of binding sites, from those binding small molecules to those binding peptides, including difficult shallow sites.
format Online
Article
Text
id pubmed-10413863
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-104138632023-08-11 SiteFerret: Beyond Simple Pocket Identification in Proteins Gagliardi, Luca Rocchia, Walter J Chem Theory Comput [Image: see text] We present a novel method for the automatic detection of pockets on protein molecular surfaces. The algorithm is based on an ad hoc hierarchical clustering of virtual probe spheres obtained from the geometrical primitives used by the NanoShaper software to build the solvent-excluded molecular surface. The final ranking of putative pockets is based on the Isolation Forest method, an unsupervised learning approach originally developed for anomaly detection. A detailed importance analysis of pocket features provides insight into which geometrical (clustering) and chemical (amino acidic composition) properties characterize a good binding site. The method also provides a segmentation of pockets into smaller subpockets. We prove that subpockets are a convenient representation to pinpoint the binding site with great precision. SiteFerret is outstanding in its versatility, accurately predicting a wide range of binding sites, from those binding small molecules to those binding peptides, including difficult shallow sites. American Chemical Society 2023-07-20 /pmc/articles/PMC10413863/ /pubmed/37470784 http://dx.doi.org/10.1021/acs.jctc.2c01306 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Gagliardi, Luca
Rocchia, Walter
SiteFerret: Beyond Simple Pocket Identification in Proteins
title SiteFerret: Beyond Simple Pocket Identification in Proteins
title_full SiteFerret: Beyond Simple Pocket Identification in Proteins
title_fullStr SiteFerret: Beyond Simple Pocket Identification in Proteins
title_full_unstemmed SiteFerret: Beyond Simple Pocket Identification in Proteins
title_short SiteFerret: Beyond Simple Pocket Identification in Proteins
title_sort siteferret: beyond simple pocket identification in proteins
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10413863/
https://www.ncbi.nlm.nih.gov/pubmed/37470784
http://dx.doi.org/10.1021/acs.jctc.2c01306
work_keys_str_mv AT gagliardiluca siteferretbeyondsimplepocketidentificationinproteins
AT rocchiawalter siteferretbeyondsimplepocketidentificationinproteins