Cargando…

RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions

We present RegioSQM20, a new version of RegioSQM (Chem Sci 9:660, 2018), which predicts the regioselectivities of electrophilic aromatic substitution (EAS) reactions from the calculation of proton affinities. The following improvements have been made: The open source semiempirical tight binding prog...

Descripción completa

Detalles Bibliográficos
Autores principales: Ree, Nicolai, Göller, Andreas H., Jensen, Jan H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7881568/
https://www.ncbi.nlm.nih.gov/pubmed/33579374
http://dx.doi.org/10.1186/s13321-021-00490-7
_version_ 1783650904241602560
author Ree, Nicolai
Göller, Andreas H.
Jensen, Jan H.
author_facet Ree, Nicolai
Göller, Andreas H.
Jensen, Jan H.
author_sort Ree, Nicolai
collection PubMed
description We present RegioSQM20, a new version of RegioSQM (Chem Sci 9:660, 2018), which predicts the regioselectivities of electrophilic aromatic substitution (EAS) reactions from the calculation of proton affinities. The following improvements have been made: The open source semiempirical tight binding program xtb is used instead of the closed source MOPAC program. Any low energy tautomeric forms of the input molecule are identified and regioselectivity predictions are made for each form. Finally, RegioSQM20 offers a qualitative prediction of the reactivity of each tautomer (low, medium, or high) based on the reaction center with the highest proton affinity. The inclusion of tautomers increases the success rate from 90.7 to 92.7%. RegioSQM20 is compared to two machine learning based models: one developed by Struble et al. (React Chem Eng 5:896, 2020) specifically for regioselectivity predictions of EAS reactions (WLN) and a more generally applicable reactivity predictor (IBM RXN) developed by Schwaller et al. (ACS Cent Sci 5:1572, 2019). RegioSQM20 and WLN offers roughly the same success rates for the entire data sets (without considering tautomers), while WLN is many orders of magnitude faster. The accuracy of the more general IBM RXN approach is somewhat lower: 76.3–85.0%, depending on the data set. The code is freely available under the MIT open source license and will be made available as a webservice (regiosqm.org) in the near future. [Image: see text]
format Online
Article
Text
id pubmed-7881568
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-78815682021-02-17 RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions Ree, Nicolai Göller, Andreas H. Jensen, Jan H. J Cheminform Research Article We present RegioSQM20, a new version of RegioSQM (Chem Sci 9:660, 2018), which predicts the regioselectivities of electrophilic aromatic substitution (EAS) reactions from the calculation of proton affinities. The following improvements have been made: The open source semiempirical tight binding program xtb is used instead of the closed source MOPAC program. Any low energy tautomeric forms of the input molecule are identified and regioselectivity predictions are made for each form. Finally, RegioSQM20 offers a qualitative prediction of the reactivity of each tautomer (low, medium, or high) based on the reaction center with the highest proton affinity. The inclusion of tautomers increases the success rate from 90.7 to 92.7%. RegioSQM20 is compared to two machine learning based models: one developed by Struble et al. (React Chem Eng 5:896, 2020) specifically for regioselectivity predictions of EAS reactions (WLN) and a more generally applicable reactivity predictor (IBM RXN) developed by Schwaller et al. (ACS Cent Sci 5:1572, 2019). RegioSQM20 and WLN offers roughly the same success rates for the entire data sets (without considering tautomers), while WLN is many orders of magnitude faster. The accuracy of the more general IBM RXN approach is somewhat lower: 76.3–85.0%, depending on the data set. The code is freely available under the MIT open source license and will be made available as a webservice (regiosqm.org) in the near future. [Image: see text] Springer International Publishing 2021-02-12 /pmc/articles/PMC7881568/ /pubmed/33579374 http://dx.doi.org/10.1186/s13321-021-00490-7 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Ree, Nicolai
Göller, Andreas H.
Jensen, Jan H.
RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions
title RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions
title_full RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions
title_fullStr RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions
title_full_unstemmed RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions
title_short RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions
title_sort regiosqm20: improved prediction of the regioselectivity of electrophilic aromatic substitutions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7881568/
https://www.ncbi.nlm.nih.gov/pubmed/33579374
http://dx.doi.org/10.1186/s13321-021-00490-7
work_keys_str_mv AT reenicolai regiosqm20improvedpredictionoftheregioselectivityofelectrophilicaromaticsubstitutions
AT gollerandreash regiosqm20improvedpredictionoftheregioselectivityofelectrophilicaromaticsubstitutions
AT jensenjanh regiosqm20improvedpredictionoftheregioselectivityofelectrophilicaromaticsubstitutions