Cargando…

Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning

In this research, a process for developing normal-phase liquid chromatography solvent systems has been proposed. In contrast to the development of conditions via thin-layer chromatography (TLC), this process is based on the architecture of two hierarchically connected neural network-based components...

Descripción completa

Detalles Bibliográficos
Autores principales: Vaškevičius, Mantas, Kapočiūtė-Dzikienė, Jurgita, Šlepikas, Liudas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8123027/
https://www.ncbi.nlm.nih.gov/pubmed/33922736
http://dx.doi.org/10.3390/molecules26092474
_version_ 1783692784521183232
author Vaškevičius, Mantas
Kapočiūtė-Dzikienė, Jurgita
Šlepikas, Liudas
author_facet Vaškevičius, Mantas
Kapočiūtė-Dzikienė, Jurgita
Šlepikas, Liudas
author_sort Vaškevičius, Mantas
collection PubMed
description In this research, a process for developing normal-phase liquid chromatography solvent systems has been proposed. In contrast to the development of conditions via thin-layer chromatography (TLC), this process is based on the architecture of two hierarchically connected neural network-based components. Using a large database of reaction procedures allows those two components to perform an essential role in the machine-learning-based prediction of chromatographic purification conditions, i.e., solvents and the ratio between solvents. In our paper, we build two datasets and test various molecular vectorization approaches, such as extended-connectivity fingerprints, learned embedding, and auto-encoders along with different types of deep neural networks to demonstrate a novel method for modeling chromatographic solvent systems employing two neural networks in sequence. Afterward, we present our findings and provide insights on the most effective methods for solving prediction tasks. Our approach results in a system of two neural networks with long short-term memory (LSTM)-based auto-encoders, where the first predicts solvent labels (by reaching the classification accuracy of 0.950 ± 0.001) and in the case of two solvents, the second one predicts the ratio between two solvents (R(2) metric equal to 0.982 ± 0.001). Our approach can be used as a guidance instrument in laboratories to accelerate scouting for suitable chromatography conditions.
format Online
Article
Text
id pubmed-8123027
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81230272021-05-16 Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning Vaškevičius, Mantas Kapočiūtė-Dzikienė, Jurgita Šlepikas, Liudas Molecules Article In this research, a process for developing normal-phase liquid chromatography solvent systems has been proposed. In contrast to the development of conditions via thin-layer chromatography (TLC), this process is based on the architecture of two hierarchically connected neural network-based components. Using a large database of reaction procedures allows those two components to perform an essential role in the machine-learning-based prediction of chromatographic purification conditions, i.e., solvents and the ratio between solvents. In our paper, we build two datasets and test various molecular vectorization approaches, such as extended-connectivity fingerprints, learned embedding, and auto-encoders along with different types of deep neural networks to demonstrate a novel method for modeling chromatographic solvent systems employing two neural networks in sequence. Afterward, we present our findings and provide insights on the most effective methods for solving prediction tasks. Our approach results in a system of two neural networks with long short-term memory (LSTM)-based auto-encoders, where the first predicts solvent labels (by reaching the classification accuracy of 0.950 ± 0.001) and in the case of two solvents, the second one predicts the ratio between two solvents (R(2) metric equal to 0.982 ± 0.001). Our approach can be used as a guidance instrument in laboratories to accelerate scouting for suitable chromatography conditions. MDPI 2021-04-23 /pmc/articles/PMC8123027/ /pubmed/33922736 http://dx.doi.org/10.3390/molecules26092474 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Vaškevičius, Mantas
Kapočiūtė-Dzikienė, Jurgita
Šlepikas, Liudas
Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning
title Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning
title_full Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning
title_fullStr Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning
title_full_unstemmed Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning
title_short Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning
title_sort prediction of chromatography conditions for purification in organic synthesis using deep learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8123027/
https://www.ncbi.nlm.nih.gov/pubmed/33922736
http://dx.doi.org/10.3390/molecules26092474
work_keys_str_mv AT vaskeviciusmantas predictionofchromatographyconditionsforpurificationinorganicsynthesisusingdeeplearning
AT kapociutedzikienejurgita predictionofchromatographyconditionsforpurificationinorganicsynthesisusingdeeplearning
AT slepikasliudas predictionofchromatographyconditionsforpurificationinorganicsynthesisusingdeeplearning