Cargando…

Systematic Data-Driven Modeling of Bimetallic Catalyst Performance for the Hydrogenation of 5-Ethoxymethylfurfural with Variable Selection and Regularization

[Image: see text] Catalyst development for biorefining applications involves many challenges. Mathematical modeling can be seen as an essential tool in assisting to explain catalyst performance. This paper presents studies on several machine learning (ML) methods that can model the performance of he...

Descripción completa

Detalles Bibliográficos
Autores principales: Uusitalo, Pekka, Sorsa, Aki, Russo Abegão, Fernando, Ohenoja, Markku, Ruusunen, Mika
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014324/
https://www.ncbi.nlm.nih.gov/pubmed/35450012
http://dx.doi.org/10.1021/acs.iecr.1c03995
_version_ 1784688183367172096
author Uusitalo, Pekka
Sorsa, Aki
Russo Abegão, Fernando
Ohenoja, Markku
Ruusunen, Mika
author_facet Uusitalo, Pekka
Sorsa, Aki
Russo Abegão, Fernando
Ohenoja, Markku
Ruusunen, Mika
author_sort Uusitalo, Pekka
collection PubMed
description [Image: see text] Catalyst development for biorefining applications involves many challenges. Mathematical modeling can be seen as an essential tool in assisting to explain catalyst performance. This paper presents studies on several machine learning (ML) methods that can model the performance of heterogeneous catalysts with relevant descriptors. A systematic approach for selecting the most appropriate ML method is taken with focus on the variable selection. Regularization algorithms were applied to variable selection. Several different candidate model structures were compared in modeling with interpretation of results. The systematic modeling approach presented aims to highlight the necessary tools and aspects to unexperienced users of ML. Literature datasets for the hydrogenation of 5-ethoxymethylfurfural with simple bimetal catalysts, including main metals and promoters, were studied with the addition of catalyst descriptors found in the literature. Good results were obtained with the best models for estimating conversion, selectivity, and yield with correlations between 0.90 and 0.98. The best identified model structures were support vector regression, Gaussian process regression, and decision tree methods. In general, the use of variable selection procedures was found to improve the performance of models. The modeling methods applied thus seem to exhibit a strong potential in aiding catalyst development based mainly on the information content of descriptor datasets.
format Online
Article
Text
id pubmed-9014324
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-90143242022-04-19 Systematic Data-Driven Modeling of Bimetallic Catalyst Performance for the Hydrogenation of 5-Ethoxymethylfurfural with Variable Selection and Regularization Uusitalo, Pekka Sorsa, Aki Russo Abegão, Fernando Ohenoja, Markku Ruusunen, Mika Ind Eng Chem Res [Image: see text] Catalyst development for biorefining applications involves many challenges. Mathematical modeling can be seen as an essential tool in assisting to explain catalyst performance. This paper presents studies on several machine learning (ML) methods that can model the performance of heterogeneous catalysts with relevant descriptors. A systematic approach for selecting the most appropriate ML method is taken with focus on the variable selection. Regularization algorithms were applied to variable selection. Several different candidate model structures were compared in modeling with interpretation of results. The systematic modeling approach presented aims to highlight the necessary tools and aspects to unexperienced users of ML. Literature datasets for the hydrogenation of 5-ethoxymethylfurfural with simple bimetal catalysts, including main metals and promoters, were studied with the addition of catalyst descriptors found in the literature. Good results were obtained with the best models for estimating conversion, selectivity, and yield with correlations between 0.90 and 0.98. The best identified model structures were support vector regression, Gaussian process regression, and decision tree methods. In general, the use of variable selection procedures was found to improve the performance of models. The modeling methods applied thus seem to exhibit a strong potential in aiding catalyst development based mainly on the information content of descriptor datasets. American Chemical Society 2022-03-31 2022-04-13 /pmc/articles/PMC9014324/ /pubmed/35450012 http://dx.doi.org/10.1021/acs.iecr.1c03995 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Uusitalo, Pekka
Sorsa, Aki
Russo Abegão, Fernando
Ohenoja, Markku
Ruusunen, Mika
Systematic Data-Driven Modeling of Bimetallic Catalyst Performance for the Hydrogenation of 5-Ethoxymethylfurfural with Variable Selection and Regularization
title Systematic Data-Driven Modeling of Bimetallic Catalyst Performance for the Hydrogenation of 5-Ethoxymethylfurfural with Variable Selection and Regularization
title_full Systematic Data-Driven Modeling of Bimetallic Catalyst Performance for the Hydrogenation of 5-Ethoxymethylfurfural with Variable Selection and Regularization
title_fullStr Systematic Data-Driven Modeling of Bimetallic Catalyst Performance for the Hydrogenation of 5-Ethoxymethylfurfural with Variable Selection and Regularization
title_full_unstemmed Systematic Data-Driven Modeling of Bimetallic Catalyst Performance for the Hydrogenation of 5-Ethoxymethylfurfural with Variable Selection and Regularization
title_short Systematic Data-Driven Modeling of Bimetallic Catalyst Performance for the Hydrogenation of 5-Ethoxymethylfurfural with Variable Selection and Regularization
title_sort systematic data-driven modeling of bimetallic catalyst performance for the hydrogenation of 5-ethoxymethylfurfural with variable selection and regularization
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014324/
https://www.ncbi.nlm.nih.gov/pubmed/35450012
http://dx.doi.org/10.1021/acs.iecr.1c03995
work_keys_str_mv AT uusitalopekka systematicdatadrivenmodelingofbimetalliccatalystperformanceforthehydrogenationof5ethoxymethylfurfuralwithvariableselectionandregularization
AT sorsaaki systematicdatadrivenmodelingofbimetalliccatalystperformanceforthehydrogenationof5ethoxymethylfurfuralwithvariableselectionandregularization
AT russoabegaofernando systematicdatadrivenmodelingofbimetalliccatalystperformanceforthehydrogenationof5ethoxymethylfurfuralwithvariableselectionandregularization
AT ohenojamarkku systematicdatadrivenmodelingofbimetalliccatalystperformanceforthehydrogenationof5ethoxymethylfurfuralwithvariableselectionandregularization
AT ruusunenmika systematicdatadrivenmodelingofbimetalliccatalystperformanceforthehydrogenationof5ethoxymethylfurfuralwithvariableselectionandregularization