Cargando…

Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework

Quantitative structure–property relationship (QSPR) models for predicting primary biodegradation of petroleum hydrocarbons have been previously developed. These models use experimental data generated under widely varied conditions, the effects of which are not captured adequately within model formal...

Descripción completa

Detalles Bibliográficos
Autores principales: Davis, Craig Warren, Camenzuli, Louise, Redman, Aaron D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9320815/
https://www.ncbi.nlm.nih.gov/pubmed/35262215
http://dx.doi.org/10.1002/etc.5328
_version_ 1784755884043272192
author Davis, Craig Warren
Camenzuli, Louise
Redman, Aaron D.
author_facet Davis, Craig Warren
Camenzuli, Louise
Redman, Aaron D.
author_sort Davis, Craig Warren
collection PubMed
description Quantitative structure–property relationship (QSPR) models for predicting primary biodegradation of petroleum hydrocarbons have been previously developed. These models use experimental data generated under widely varied conditions, the effects of which are not captured adequately within model formalisms. As a result, they exhibit variable predictive performance and are unable to incorporate the role of study design and test conditions on the assessment of environmental persistence. To address these limitations, a novel machine‐learning System‐Integrated Model (HC‐BioSIM) is presented, which integrates chemical structure and test system variability, leading to improved prediction of primary disappearance time (DT50) values for petroleum hydrocarbons in fresh and marine water. An expanded, highly curated database of 728 experimental DT50 values (181 unique hydrocarbon structures compiled from 13 primary sources) was used to develop and validate a supervised model tree machine‐learning model. Using relatively few parameters (6 system and 25 structural parameters), the model demonstrated significant improvement in predictive performance (root mean square error = 0.26, R (2) = 0.67) over existing QSPR models. The model also demonstrated improved accuracy of persistence (P) categorization (i.e., “Not P/P/vP”), with an accuracy of 96.8%, and false‐positive and ‐negative categorization rates of 0.4% and 2.7%, respectively. This significant improvement in DT50 prediction, and subsequent persistence categorization, validates the need for models that integrate experimental design and environmental system parameters into biodegradation and persistence assessment. Environ Toxicol Chem 2022;41:1359–1369. © 2022 ExxonMobil Biomedical Sciences, Inc. Environmental Toxicology and Chemistry published by Wiley Periodicals LLC on behalf of SETAC.
format Online
Article
Text
id pubmed-9320815
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-93208152022-07-30 Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework Davis, Craig Warren Camenzuli, Louise Redman, Aaron D. Environ Toxicol Chem Environmental Chemistry Quantitative structure–property relationship (QSPR) models for predicting primary biodegradation of petroleum hydrocarbons have been previously developed. These models use experimental data generated under widely varied conditions, the effects of which are not captured adequately within model formalisms. As a result, they exhibit variable predictive performance and are unable to incorporate the role of study design and test conditions on the assessment of environmental persistence. To address these limitations, a novel machine‐learning System‐Integrated Model (HC‐BioSIM) is presented, which integrates chemical structure and test system variability, leading to improved prediction of primary disappearance time (DT50) values for petroleum hydrocarbons in fresh and marine water. An expanded, highly curated database of 728 experimental DT50 values (181 unique hydrocarbon structures compiled from 13 primary sources) was used to develop and validate a supervised model tree machine‐learning model. Using relatively few parameters (6 system and 25 structural parameters), the model demonstrated significant improvement in predictive performance (root mean square error = 0.26, R (2) = 0.67) over existing QSPR models. The model also demonstrated improved accuracy of persistence (P) categorization (i.e., “Not P/P/vP”), with an accuracy of 96.8%, and false‐positive and ‐negative categorization rates of 0.4% and 2.7%, respectively. This significant improvement in DT50 prediction, and subsequent persistence categorization, validates the need for models that integrate experimental design and environmental system parameters into biodegradation and persistence assessment. Environ Toxicol Chem 2022;41:1359–1369. © 2022 ExxonMobil Biomedical Sciences, Inc. Environmental Toxicology and Chemistry published by Wiley Periodicals LLC on behalf of SETAC. John Wiley and Sons Inc. 2022-04-29 2022-06 /pmc/articles/PMC9320815/ /pubmed/35262215 http://dx.doi.org/10.1002/etc.5328 Text en © 2022 ExxonMobil Biomedical Sciences, Inc. Environmental Toxicology and Chemistry published by Wiley Periodicals LLC on behalf of SETAC. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Environmental Chemistry
Davis, Craig Warren
Camenzuli, Louise
Redman, Aaron D.
Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework
title Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework
title_full Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework
title_fullStr Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework
title_full_unstemmed Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework
title_short Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework
title_sort predicting primary biodegradation of petroleum hydrocarbons in aquatic systems: integrating system and molecular structure parameters using a novel machine‐learning framework
topic Environmental Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9320815/
https://www.ncbi.nlm.nih.gov/pubmed/35262215
http://dx.doi.org/10.1002/etc.5328
work_keys_str_mv AT daviscraigwarren predictingprimarybiodegradationofpetroleumhydrocarbonsinaquaticsystemsintegratingsystemandmolecularstructureparametersusinganovelmachinelearningframework
AT camenzulilouise predictingprimarybiodegradationofpetroleumhydrocarbonsinaquaticsystemsintegratingsystemandmolecularstructureparametersusinganovelmachinelearningframework
AT redmanaarond predictingprimarybiodegradationofpetroleumhydrocarbonsinaquaticsystemsintegratingsystemandmolecularstructureparametersusinganovelmachinelearningframework