Cargando…
SiteFerret: Beyond Simple Pocket Identification in Proteins
[Image: see text] We present a novel method for the automatic detection of pockets on protein molecular surfaces. The algorithm is based on an ad hoc hierarchical clustering of virtual probe spheres obtained from the geometrical primitives used by the NanoShaper software to build the solvent-exclude...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10413863/ https://www.ncbi.nlm.nih.gov/pubmed/37470784 http://dx.doi.org/10.1021/acs.jctc.2c01306 |
_version_ | 1785087222203023360 |
---|---|
author | Gagliardi, Luca Rocchia, Walter |
author_facet | Gagliardi, Luca Rocchia, Walter |
author_sort | Gagliardi, Luca |
collection | PubMed |
description | [Image: see text] We present a novel method for the automatic detection of pockets on protein molecular surfaces. The algorithm is based on an ad hoc hierarchical clustering of virtual probe spheres obtained from the geometrical primitives used by the NanoShaper software to build the solvent-excluded molecular surface. The final ranking of putative pockets is based on the Isolation Forest method, an unsupervised learning approach originally developed for anomaly detection. A detailed importance analysis of pocket features provides insight into which geometrical (clustering) and chemical (amino acidic composition) properties characterize a good binding site. The method also provides a segmentation of pockets into smaller subpockets. We prove that subpockets are a convenient representation to pinpoint the binding site with great precision. SiteFerret is outstanding in its versatility, accurately predicting a wide range of binding sites, from those binding small molecules to those binding peptides, including difficult shallow sites. |
format | Online Article Text |
id | pubmed-10413863 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-104138632023-08-11 SiteFerret: Beyond Simple Pocket Identification in Proteins Gagliardi, Luca Rocchia, Walter J Chem Theory Comput [Image: see text] We present a novel method for the automatic detection of pockets on protein molecular surfaces. The algorithm is based on an ad hoc hierarchical clustering of virtual probe spheres obtained from the geometrical primitives used by the NanoShaper software to build the solvent-excluded molecular surface. The final ranking of putative pockets is based on the Isolation Forest method, an unsupervised learning approach originally developed for anomaly detection. A detailed importance analysis of pocket features provides insight into which geometrical (clustering) and chemical (amino acidic composition) properties characterize a good binding site. The method also provides a segmentation of pockets into smaller subpockets. We prove that subpockets are a convenient representation to pinpoint the binding site with great precision. SiteFerret is outstanding in its versatility, accurately predicting a wide range of binding sites, from those binding small molecules to those binding peptides, including difficult shallow sites. American Chemical Society 2023-07-20 /pmc/articles/PMC10413863/ /pubmed/37470784 http://dx.doi.org/10.1021/acs.jctc.2c01306 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Gagliardi, Luca Rocchia, Walter SiteFerret: Beyond Simple Pocket Identification in Proteins |
title | SiteFerret: Beyond
Simple Pocket Identification in
Proteins |
title_full | SiteFerret: Beyond
Simple Pocket Identification in
Proteins |
title_fullStr | SiteFerret: Beyond
Simple Pocket Identification in
Proteins |
title_full_unstemmed | SiteFerret: Beyond
Simple Pocket Identification in
Proteins |
title_short | SiteFerret: Beyond
Simple Pocket Identification in
Proteins |
title_sort | siteferret: beyond
simple pocket identification in
proteins |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10413863/ https://www.ncbi.nlm.nih.gov/pubmed/37470784 http://dx.doi.org/10.1021/acs.jctc.2c01306 |
work_keys_str_mv | AT gagliardiluca siteferretbeyondsimplepocketidentificationinproteins AT rocchiawalter siteferretbeyondsimplepocketidentificationinproteins |