Cargando…
RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions
We present RegioSQM20, a new version of RegioSQM (Chem Sci 9:660, 2018), which predicts the regioselectivities of electrophilic aromatic substitution (EAS) reactions from the calculation of proton affinities. The following improvements have been made: The open source semiempirical tight binding prog...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7881568/ https://www.ncbi.nlm.nih.gov/pubmed/33579374 http://dx.doi.org/10.1186/s13321-021-00490-7 |
_version_ | 1783650904241602560 |
---|---|
author | Ree, Nicolai Göller, Andreas H. Jensen, Jan H. |
author_facet | Ree, Nicolai Göller, Andreas H. Jensen, Jan H. |
author_sort | Ree, Nicolai |
collection | PubMed |
description | We present RegioSQM20, a new version of RegioSQM (Chem Sci 9:660, 2018), which predicts the regioselectivities of electrophilic aromatic substitution (EAS) reactions from the calculation of proton affinities. The following improvements have been made: The open source semiempirical tight binding program xtb is used instead of the closed source MOPAC program. Any low energy tautomeric forms of the input molecule are identified and regioselectivity predictions are made for each form. Finally, RegioSQM20 offers a qualitative prediction of the reactivity of each tautomer (low, medium, or high) based on the reaction center with the highest proton affinity. The inclusion of tautomers increases the success rate from 90.7 to 92.7%. RegioSQM20 is compared to two machine learning based models: one developed by Struble et al. (React Chem Eng 5:896, 2020) specifically for regioselectivity predictions of EAS reactions (WLN) and a more generally applicable reactivity predictor (IBM RXN) developed by Schwaller et al. (ACS Cent Sci 5:1572, 2019). RegioSQM20 and WLN offers roughly the same success rates for the entire data sets (without considering tautomers), while WLN is many orders of magnitude faster. The accuracy of the more general IBM RXN approach is somewhat lower: 76.3–85.0%, depending on the data set. The code is freely available under the MIT open source license and will be made available as a webservice (regiosqm.org) in the near future. [Image: see text] |
format | Online Article Text |
id | pubmed-7881568 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-78815682021-02-17 RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions Ree, Nicolai Göller, Andreas H. Jensen, Jan H. J Cheminform Research Article We present RegioSQM20, a new version of RegioSQM (Chem Sci 9:660, 2018), which predicts the regioselectivities of electrophilic aromatic substitution (EAS) reactions from the calculation of proton affinities. The following improvements have been made: The open source semiempirical tight binding program xtb is used instead of the closed source MOPAC program. Any low energy tautomeric forms of the input molecule are identified and regioselectivity predictions are made for each form. Finally, RegioSQM20 offers a qualitative prediction of the reactivity of each tautomer (low, medium, or high) based on the reaction center with the highest proton affinity. The inclusion of tautomers increases the success rate from 90.7 to 92.7%. RegioSQM20 is compared to two machine learning based models: one developed by Struble et al. (React Chem Eng 5:896, 2020) specifically for regioselectivity predictions of EAS reactions (WLN) and a more generally applicable reactivity predictor (IBM RXN) developed by Schwaller et al. (ACS Cent Sci 5:1572, 2019). RegioSQM20 and WLN offers roughly the same success rates for the entire data sets (without considering tautomers), while WLN is many orders of magnitude faster. The accuracy of the more general IBM RXN approach is somewhat lower: 76.3–85.0%, depending on the data set. The code is freely available under the MIT open source license and will be made available as a webservice (regiosqm.org) in the near future. [Image: see text] Springer International Publishing 2021-02-12 /pmc/articles/PMC7881568/ /pubmed/33579374 http://dx.doi.org/10.1186/s13321-021-00490-7 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Ree, Nicolai Göller, Andreas H. Jensen, Jan H. RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions |
title | RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions |
title_full | RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions |
title_fullStr | RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions |
title_full_unstemmed | RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions |
title_short | RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions |
title_sort | regiosqm20: improved prediction of the regioselectivity of electrophilic aromatic substitutions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7881568/ https://www.ncbi.nlm.nih.gov/pubmed/33579374 http://dx.doi.org/10.1186/s13321-021-00490-7 |
work_keys_str_mv | AT reenicolai regiosqm20improvedpredictionoftheregioselectivityofelectrophilicaromaticsubstitutions AT gollerandreash regiosqm20improvedpredictionoftheregioselectivityofelectrophilicaromaticsubstitutions AT jensenjanh regiosqm20improvedpredictionoftheregioselectivityofelectrophilicaromaticsubstitutions |