Cargando…

Machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms

The molecularly imprinted polymers are artificial polymers that, during the synthesis, create specific sites for a definite purpose. These polymers due to their characteristics such as stability, easy of synthesis, reproducibility, reusability, high accuracy, and selectivity have many applications....

Descripción completa

Detalles Bibliográficos
Autores principales:	Yarahmadi, Bita, Hashemianzadeh, Seyed Majid, Milani Hosseini, Seyed Mohammad-Reza
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10372080/ https://www.ncbi.nlm.nih.gov/pubmed/37495673 http://dx.doi.org/10.1038/s41598-023-39374-1

_version_	1785078291571408896
author	Yarahmadi, Bita Hashemianzadeh, Seyed Majid Milani Hosseini, Seyed Mohammad-Reza
author_facet	Yarahmadi, Bita Hashemianzadeh, Seyed Majid Milani Hosseini, Seyed Mohammad-Reza
author_sort	Yarahmadi, Bita
collection	PubMed
description	The molecularly imprinted polymers are artificial polymers that, during the synthesis, create specific sites for a definite purpose. These polymers due to their characteristics such as stability, easy of synthesis, reproducibility, reusability, high accuracy, and selectivity have many applications. However, the variety of the functional monomers, templates, solvents, and synthesis conditions like pH, temperature, the rate of stirring, and time, limit the selectivity of imprinting. The Practical optimization of the synthetic conditions has many drawbacks, including chemical compound usage, equipment requirements, and time costs. The use of machine learning (ML) for the prediction of the imprinting factor (IF), which indicates the quality of imprinting is a very interesting idea to overcome these problems. The ML has many advantages, for example a lack of human error, high accuracy, high repeatability, and prediction of a large amount of data in the minimum time. In this research, ML was used to predict the IF using non-linear regression algorithms, including classification and regression tree, support vector regression, and k-nearest neighbors, and ensemble algorithms, like gradient boosting (GB), random forest, and extra trees. The data sets were obtained practically in the laboratory, and inputs, included pH, the type of the template, the type of the monomer, solvent, the distribution coefficient of the MIP (K(MIP)), and the distribution coefficient of the non-imprinted polymer (K(NIP)). The mutual information feature selection method was used to select the important features affecting the IF. The results showed that the GB algorithm had the best performance in predicting the IF, and using this algorithm, the maximum R(2) value (R(2) = 0.871), and the minimum mean absolute error (MAE = − 0.982), and mean square error were obtained (MSE = − 2.303).
format	Online Article Text
id	pubmed-10372080
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-103720802023-07-28 Machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms Yarahmadi, Bita Hashemianzadeh, Seyed Majid Milani Hosseini, Seyed Mohammad-Reza Sci Rep Article The molecularly imprinted polymers are artificial polymers that, during the synthesis, create specific sites for a definite purpose. These polymers due to their characteristics such as stability, easy of synthesis, reproducibility, reusability, high accuracy, and selectivity have many applications. However, the variety of the functional monomers, templates, solvents, and synthesis conditions like pH, temperature, the rate of stirring, and time, limit the selectivity of imprinting. The Practical optimization of the synthetic conditions has many drawbacks, including chemical compound usage, equipment requirements, and time costs. The use of machine learning (ML) for the prediction of the imprinting factor (IF), which indicates the quality of imprinting is a very interesting idea to overcome these problems. The ML has many advantages, for example a lack of human error, high accuracy, high repeatability, and prediction of a large amount of data in the minimum time. In this research, ML was used to predict the IF using non-linear regression algorithms, including classification and regression tree, support vector regression, and k-nearest neighbors, and ensemble algorithms, like gradient boosting (GB), random forest, and extra trees. The data sets were obtained practically in the laboratory, and inputs, included pH, the type of the template, the type of the monomer, solvent, the distribution coefficient of the MIP (K(MIP)), and the distribution coefficient of the non-imprinted polymer (K(NIP)). The mutual information feature selection method was used to select the important features affecting the IF. The results showed that the GB algorithm had the best performance in predicting the IF, and using this algorithm, the maximum R(2) value (R(2) = 0.871), and the minimum mean absolute error (MAE = − 0.982), and mean square error were obtained (MSE = − 2.303). Nature Publishing Group UK 2023-07-26 /pmc/articles/PMC10372080/ /pubmed/37495673 http://dx.doi.org/10.1038/s41598-023-39374-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Yarahmadi, Bita Hashemianzadeh, Seyed Majid Milani Hosseini, Seyed Mohammad-Reza Machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms
title	Machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms
title_full	Machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms
title_fullStr	Machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms
title_full_unstemmed	Machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms
title_short	Machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms
title_sort	machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10372080/ https://www.ncbi.nlm.nih.gov/pubmed/37495673 http://dx.doi.org/10.1038/s41598-023-39374-1
work_keys_str_mv	AT yarahmadibita machinelearningbasedpredictionsofimprintingqualityusingensembleandnonlinearregressionalgorithms AT hashemianzadehseyedmajid machinelearningbasedpredictionsofimprintingqualityusingensembleandnonlinearregressionalgorithms AT milanihosseiniseyedmohammadreza machinelearningbasedpredictionsofimprintingqualityusingensembleandnonlinearregressionalgorithms

Machine-learning-based predictions of imprinting quality using ensemble and non-linear regression algorithms

Ejemplares similares