Cargando…
Chemical property prediction under experimental biases
Predicting the chemical properties of compounds is crucial in discovering novel materials and drugs with specific desired characteristics. Recent significant advances in machine learning technologies have enabled automatic predictive modeling from past experimental data reported in the literature. H...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114131/ https://www.ncbi.nlm.nih.gov/pubmed/35581358 http://dx.doi.org/10.1038/s41598-022-12116-5 |
_version_ | 1784709716642889728 |
---|---|
author | Liu, Yang Kashima, Hisashi |
author_facet | Liu, Yang Kashima, Hisashi |
author_sort | Liu, Yang |
collection | PubMed |
description | Predicting the chemical properties of compounds is crucial in discovering novel materials and drugs with specific desired characteristics. Recent significant advances in machine learning technologies have enabled automatic predictive modeling from past experimental data reported in the literature. However, these datasets are often biased because of various reasons, such as experimental plans and publication decisions, and the prediction models trained using such biased datasets often suffer from over-fitting to the biased distributions and perform poorly on subsequent uses. Hence, this study focused on mitigating bias in the experimental datasets. We adopted two techniques from causal inference combined with graph neural networks that can represent molecular structures. The experimental results in four possible bias scenarios indicated that the inverse propensity scoring-based method and the counter-factual regression-based method made solid improvements. |
format | Online Article Text |
id | pubmed-9114131 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-91141312022-05-19 Chemical property prediction under experimental biases Liu, Yang Kashima, Hisashi Sci Rep Article Predicting the chemical properties of compounds is crucial in discovering novel materials and drugs with specific desired characteristics. Recent significant advances in machine learning technologies have enabled automatic predictive modeling from past experimental data reported in the literature. However, these datasets are often biased because of various reasons, such as experimental plans and publication decisions, and the prediction models trained using such biased datasets often suffer from over-fitting to the biased distributions and perform poorly on subsequent uses. Hence, this study focused on mitigating bias in the experimental datasets. We adopted two techniques from causal inference combined with graph neural networks that can represent molecular structures. The experimental results in four possible bias scenarios indicated that the inverse propensity scoring-based method and the counter-factual regression-based method made solid improvements. Nature Publishing Group UK 2022-05-17 /pmc/articles/PMC9114131/ /pubmed/35581358 http://dx.doi.org/10.1038/s41598-022-12116-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Liu, Yang Kashima, Hisashi Chemical property prediction under experimental biases |
title | Chemical property prediction under experimental biases |
title_full | Chemical property prediction under experimental biases |
title_fullStr | Chemical property prediction under experimental biases |
title_full_unstemmed | Chemical property prediction under experimental biases |
title_short | Chemical property prediction under experimental biases |
title_sort | chemical property prediction under experimental biases |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114131/ https://www.ncbi.nlm.nih.gov/pubmed/35581358 http://dx.doi.org/10.1038/s41598-022-12116-5 |
work_keys_str_mv | AT liuyang chemicalpropertypredictionunderexperimentalbiases AT kashimahisashi chemicalpropertypredictionunderexperimentalbiases |