Cargando…
Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework
Quantitative structure–property relationship (QSPR) models for predicting primary biodegradation of petroleum hydrocarbons have been previously developed. These models use experimental data generated under widely varied conditions, the effects of which are not captured adequately within model formal...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9320815/ https://www.ncbi.nlm.nih.gov/pubmed/35262215 http://dx.doi.org/10.1002/etc.5328 |
_version_ | 1784755884043272192 |
---|---|
author | Davis, Craig Warren Camenzuli, Louise Redman, Aaron D. |
author_facet | Davis, Craig Warren Camenzuli, Louise Redman, Aaron D. |
author_sort | Davis, Craig Warren |
collection | PubMed |
description | Quantitative structure–property relationship (QSPR) models for predicting primary biodegradation of petroleum hydrocarbons have been previously developed. These models use experimental data generated under widely varied conditions, the effects of which are not captured adequately within model formalisms. As a result, they exhibit variable predictive performance and are unable to incorporate the role of study design and test conditions on the assessment of environmental persistence. To address these limitations, a novel machine‐learning System‐Integrated Model (HC‐BioSIM) is presented, which integrates chemical structure and test system variability, leading to improved prediction of primary disappearance time (DT50) values for petroleum hydrocarbons in fresh and marine water. An expanded, highly curated database of 728 experimental DT50 values (181 unique hydrocarbon structures compiled from 13 primary sources) was used to develop and validate a supervised model tree machine‐learning model. Using relatively few parameters (6 system and 25 structural parameters), the model demonstrated significant improvement in predictive performance (root mean square error = 0.26, R (2) = 0.67) over existing QSPR models. The model also demonstrated improved accuracy of persistence (P) categorization (i.e., “Not P/P/vP”), with an accuracy of 96.8%, and false‐positive and ‐negative categorization rates of 0.4% and 2.7%, respectively. This significant improvement in DT50 prediction, and subsequent persistence categorization, validates the need for models that integrate experimental design and environmental system parameters into biodegradation and persistence assessment. Environ Toxicol Chem 2022;41:1359–1369. © 2022 ExxonMobil Biomedical Sciences, Inc. Environmental Toxicology and Chemistry published by Wiley Periodicals LLC on behalf of SETAC. |
format | Online Article Text |
id | pubmed-9320815 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93208152022-07-30 Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework Davis, Craig Warren Camenzuli, Louise Redman, Aaron D. Environ Toxicol Chem Environmental Chemistry Quantitative structure–property relationship (QSPR) models for predicting primary biodegradation of petroleum hydrocarbons have been previously developed. These models use experimental data generated under widely varied conditions, the effects of which are not captured adequately within model formalisms. As a result, they exhibit variable predictive performance and are unable to incorporate the role of study design and test conditions on the assessment of environmental persistence. To address these limitations, a novel machine‐learning System‐Integrated Model (HC‐BioSIM) is presented, which integrates chemical structure and test system variability, leading to improved prediction of primary disappearance time (DT50) values for petroleum hydrocarbons in fresh and marine water. An expanded, highly curated database of 728 experimental DT50 values (181 unique hydrocarbon structures compiled from 13 primary sources) was used to develop and validate a supervised model tree machine‐learning model. Using relatively few parameters (6 system and 25 structural parameters), the model demonstrated significant improvement in predictive performance (root mean square error = 0.26, R (2) = 0.67) over existing QSPR models. The model also demonstrated improved accuracy of persistence (P) categorization (i.e., “Not P/P/vP”), with an accuracy of 96.8%, and false‐positive and ‐negative categorization rates of 0.4% and 2.7%, respectively. This significant improvement in DT50 prediction, and subsequent persistence categorization, validates the need for models that integrate experimental design and environmental system parameters into biodegradation and persistence assessment. Environ Toxicol Chem 2022;41:1359–1369. © 2022 ExxonMobil Biomedical Sciences, Inc. Environmental Toxicology and Chemistry published by Wiley Periodicals LLC on behalf of SETAC. John Wiley and Sons Inc. 2022-04-29 2022-06 /pmc/articles/PMC9320815/ /pubmed/35262215 http://dx.doi.org/10.1002/etc.5328 Text en © 2022 ExxonMobil Biomedical Sciences, Inc. Environmental Toxicology and Chemistry published by Wiley Periodicals LLC on behalf of SETAC. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Environmental Chemistry Davis, Craig Warren Camenzuli, Louise Redman, Aaron D. Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework |
title | Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework |
title_full | Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework |
title_fullStr | Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework |
title_full_unstemmed | Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework |
title_short | Predicting Primary Biodegradation of Petroleum Hydrocarbons in Aquatic Systems: Integrating System and Molecular Structure Parameters using a Novel Machine‐Learning Framework |
title_sort | predicting primary biodegradation of petroleum hydrocarbons in aquatic systems: integrating system and molecular structure parameters using a novel machine‐learning framework |
topic | Environmental Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9320815/ https://www.ncbi.nlm.nih.gov/pubmed/35262215 http://dx.doi.org/10.1002/etc.5328 |
work_keys_str_mv | AT daviscraigwarren predictingprimarybiodegradationofpetroleumhydrocarbonsinaquaticsystemsintegratingsystemandmolecularstructureparametersusinganovelmachinelearningframework AT camenzulilouise predictingprimarybiodegradationofpetroleumhydrocarbonsinaquaticsystemsintegratingsystemandmolecularstructureparametersusinganovelmachinelearningframework AT redmanaarond predictingprimarybiodegradationofpetroleumhydrocarbonsinaquaticsystemsintegratingsystemandmolecularstructureparametersusinganovelmachinelearningframework |