Cargando…

Chemical property prediction under experimental biases

Predicting the chemical properties of compounds is crucial in discovering novel materials and drugs with specific desired characteristics. Recent significant advances in machine learning technologies have enabled automatic predictive modeling from past experimental data reported in the literature. H...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yang, Kashima, Hisashi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114131/
https://www.ncbi.nlm.nih.gov/pubmed/35581358
http://dx.doi.org/10.1038/s41598-022-12116-5
_version_ 1784709716642889728
author Liu, Yang
Kashima, Hisashi
author_facet Liu, Yang
Kashima, Hisashi
author_sort Liu, Yang
collection PubMed
description Predicting the chemical properties of compounds is crucial in discovering novel materials and drugs with specific desired characteristics. Recent significant advances in machine learning technologies have enabled automatic predictive modeling from past experimental data reported in the literature. However, these datasets are often biased because of various reasons, such as experimental plans and publication decisions, and the prediction models trained using such biased datasets often suffer from over-fitting to the biased distributions and perform poorly on subsequent uses. Hence, this study focused on mitigating bias in the experimental datasets. We adopted two techniques from causal inference combined with graph neural networks that can represent molecular structures. The experimental results in four possible bias scenarios indicated that the inverse propensity scoring-based method and the counter-factual regression-based method made solid improvements.
format Online
Article
Text
id pubmed-9114131
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-91141312022-05-19 Chemical property prediction under experimental biases Liu, Yang Kashima, Hisashi Sci Rep Article Predicting the chemical properties of compounds is crucial in discovering novel materials and drugs with specific desired characteristics. Recent significant advances in machine learning technologies have enabled automatic predictive modeling from past experimental data reported in the literature. However, these datasets are often biased because of various reasons, such as experimental plans and publication decisions, and the prediction models trained using such biased datasets often suffer from over-fitting to the biased distributions and perform poorly on subsequent uses. Hence, this study focused on mitigating bias in the experimental datasets. We adopted two techniques from causal inference combined with graph neural networks that can represent molecular structures. The experimental results in four possible bias scenarios indicated that the inverse propensity scoring-based method and the counter-factual regression-based method made solid improvements. Nature Publishing Group UK 2022-05-17 /pmc/articles/PMC9114131/ /pubmed/35581358 http://dx.doi.org/10.1038/s41598-022-12116-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Liu, Yang
Kashima, Hisashi
Chemical property prediction under experimental biases
title Chemical property prediction under experimental biases
title_full Chemical property prediction under experimental biases
title_fullStr Chemical property prediction under experimental biases
title_full_unstemmed Chemical property prediction under experimental biases
title_short Chemical property prediction under experimental biases
title_sort chemical property prediction under experimental biases
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114131/
https://www.ncbi.nlm.nih.gov/pubmed/35581358
http://dx.doi.org/10.1038/s41598-022-12116-5
work_keys_str_mv AT liuyang chemicalpropertypredictionunderexperimentalbiases
AT kashimahisashi chemicalpropertypredictionunderexperimentalbiases