Cargando…

MetaClass, a Comprehensive Classification System for Predicting the Occurrence of Metabolic Reactions Based on the MetaQSAR Database

(1) Background: Machine learning algorithms are finding fruitful applications in predicting the ADME profile of new molecules, with a particular focus on metabolism predictions. However, the development of comprehensive metabolism predictors is hampered by the lack of highly accurate metabolic resou...

Descripción completa

Detalles Bibliográficos
Autores principales: Mazzolari, Angelica, Scaccabarozzi, Alice, Vistoli, Giulio, Pedretti, Alessandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512547/
https://www.ncbi.nlm.nih.gov/pubmed/34641400
http://dx.doi.org/10.3390/molecules26195857
_version_ 1784583019094343680
author Mazzolari, Angelica
Scaccabarozzi, Alice
Vistoli, Giulio
Pedretti, Alessandro
author_facet Mazzolari, Angelica
Scaccabarozzi, Alice
Vistoli, Giulio
Pedretti, Alessandro
author_sort Mazzolari, Angelica
collection PubMed
description (1) Background: Machine learning algorithms are finding fruitful applications in predicting the ADME profile of new molecules, with a particular focus on metabolism predictions. However, the development of comprehensive metabolism predictors is hampered by the lack of highly accurate metabolic resources. Hence, we recently proposed a manually curated metabolic database (MetaQSAR), the level of accuracy of which is well suited to the development of predictive models. (2) Methods: MetaQSAR was used to extract datasets to predict the metabolic reactions subdivided into major classes, classes and subclasses. The collected datasets comprised a total of 3788 first-generation metabolic reactions. Predictive models were developed by using standard random forest algorithms and sets of physicochemical, stereo-electronic and constitutional descriptors. (3) Results: The developed models showed satisfactory performance, especially for hydrolyses and conjugations, while redox reactions were predicted with greater difficulty, which was reasonable as they depend on many complex features that are not properly encoded by the included descriptors. (4) Conclusions: The generated models allowed a precise comparison of the propensity of each metabolic reaction to be predicted and the factors affecting their predictability were discussed in detail. Overall, the study led to the development of a freely downloadable global predictor, MetaClass, which correctly predicts 80% of the reported reactions, as assessed by an explorative validation analysis on an external dataset, with an overall MCC = 0.44.
format Online
Article
Text
id pubmed-8512547
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85125472021-10-14 MetaClass, a Comprehensive Classification System for Predicting the Occurrence of Metabolic Reactions Based on the MetaQSAR Database Mazzolari, Angelica Scaccabarozzi, Alice Vistoli, Giulio Pedretti, Alessandro Molecules Article (1) Background: Machine learning algorithms are finding fruitful applications in predicting the ADME profile of new molecules, with a particular focus on metabolism predictions. However, the development of comprehensive metabolism predictors is hampered by the lack of highly accurate metabolic resources. Hence, we recently proposed a manually curated metabolic database (MetaQSAR), the level of accuracy of which is well suited to the development of predictive models. (2) Methods: MetaQSAR was used to extract datasets to predict the metabolic reactions subdivided into major classes, classes and subclasses. The collected datasets comprised a total of 3788 first-generation metabolic reactions. Predictive models were developed by using standard random forest algorithms and sets of physicochemical, stereo-electronic and constitutional descriptors. (3) Results: The developed models showed satisfactory performance, especially for hydrolyses and conjugations, while redox reactions were predicted with greater difficulty, which was reasonable as they depend on many complex features that are not properly encoded by the included descriptors. (4) Conclusions: The generated models allowed a precise comparison of the propensity of each metabolic reaction to be predicted and the factors affecting their predictability were discussed in detail. Overall, the study led to the development of a freely downloadable global predictor, MetaClass, which correctly predicts 80% of the reported reactions, as assessed by an explorative validation analysis on an external dataset, with an overall MCC = 0.44. MDPI 2021-09-27 /pmc/articles/PMC8512547/ /pubmed/34641400 http://dx.doi.org/10.3390/molecules26195857 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mazzolari, Angelica
Scaccabarozzi, Alice
Vistoli, Giulio
Pedretti, Alessandro
MetaClass, a Comprehensive Classification System for Predicting the Occurrence of Metabolic Reactions Based on the MetaQSAR Database
title MetaClass, a Comprehensive Classification System for Predicting the Occurrence of Metabolic Reactions Based on the MetaQSAR Database
title_full MetaClass, a Comprehensive Classification System for Predicting the Occurrence of Metabolic Reactions Based on the MetaQSAR Database
title_fullStr MetaClass, a Comprehensive Classification System for Predicting the Occurrence of Metabolic Reactions Based on the MetaQSAR Database
title_full_unstemmed MetaClass, a Comprehensive Classification System for Predicting the Occurrence of Metabolic Reactions Based on the MetaQSAR Database
title_short MetaClass, a Comprehensive Classification System for Predicting the Occurrence of Metabolic Reactions Based on the MetaQSAR Database
title_sort metaclass, a comprehensive classification system for predicting the occurrence of metabolic reactions based on the metaqsar database
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512547/
https://www.ncbi.nlm.nih.gov/pubmed/34641400
http://dx.doi.org/10.3390/molecules26195857
work_keys_str_mv AT mazzolariangelica metaclassacomprehensiveclassificationsystemforpredictingtheoccurrenceofmetabolicreactionsbasedonthemetaqsardatabase
AT scaccabarozzialice metaclassacomprehensiveclassificationsystemforpredictingtheoccurrenceofmetabolicreactionsbasedonthemetaqsardatabase
AT vistoligiulio metaclassacomprehensiveclassificationsystemforpredictingtheoccurrenceofmetabolicreactionsbasedonthemetaqsardatabase
AT pedrettialessandro metaclassacomprehensiveclassificationsystemforpredictingtheoccurrenceofmetabolicreactionsbasedonthemetaqsardatabase