Cargando…
Integrating Incompatible Assay Data Sets with Deep Preference Learning
[Image: see text] A large amount of bioactivity assay data is already accumulated in public databases, but the integration of these data sets for quantitative structure–activity relationship (QSAR) studies is not straightforward due to differences in experimental methods and settings. We present an...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2021
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8762726/ https://www.ncbi.nlm.nih.gov/pubmed/35047110 http://dx.doi.org/10.1021/acsmedchemlett.1c00439 |
_version_ | 1784633824429211648 |
---|---|
author | Sun, Xiaolin Tamura, Ryo Sumita, Masato Mori, Kenichi Terayama, Kei Tsuda, Koji |
author_facet | Sun, Xiaolin Tamura, Ryo Sumita, Masato Mori, Kenichi Terayama, Kei Tsuda, Koji |
author_sort | Sun, Xiaolin |
collection | PubMed |
description | [Image: see text] A large amount of bioactivity assay data is already accumulated in public databases, but the integration of these data sets for quantitative structure–activity relationship (QSAR) studies is not straightforward due to differences in experimental methods and settings. We present an efficient deep-learning-based approach called Deep Preference Data Integration (DPDI). For integrating outcome variables of different assay types, a surrogate variable is introduced, and a neural network is trained such that the total order induced by the surrogate variable is maximally consistent with given data sets. In a task of predicting efficacy of factor Xa inhibitors, DPDI successfully integrated 2959 molecules distributed in 129 assay data sets. In most of our experiments, data integration improved prediction accuracy strongly in interpolation and extrapolation tasks, indicating that DPDI is an effective tool for QSAR studies. |
format | Online Article Text |
id | pubmed-8762726 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-87627262022-01-18 Integrating Incompatible Assay Data Sets with Deep Preference Learning Sun, Xiaolin Tamura, Ryo Sumita, Masato Mori, Kenichi Terayama, Kei Tsuda, Koji ACS Med Chem Lett [Image: see text] A large amount of bioactivity assay data is already accumulated in public databases, but the integration of these data sets for quantitative structure–activity relationship (QSAR) studies is not straightforward due to differences in experimental methods and settings. We present an efficient deep-learning-based approach called Deep Preference Data Integration (DPDI). For integrating outcome variables of different assay types, a surrogate variable is introduced, and a neural network is trained such that the total order induced by the surrogate variable is maximally consistent with given data sets. In a task of predicting efficacy of factor Xa inhibitors, DPDI successfully integrated 2959 molecules distributed in 129 assay data sets. In most of our experiments, data integration improved prediction accuracy strongly in interpolation and extrapolation tasks, indicating that DPDI is an effective tool for QSAR studies. American Chemical Society 2021-12-29 /pmc/articles/PMC8762726/ /pubmed/35047110 http://dx.doi.org/10.1021/acsmedchemlett.1c00439 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Sun, Xiaolin Tamura, Ryo Sumita, Masato Mori, Kenichi Terayama, Kei Tsuda, Koji Integrating Incompatible Assay Data Sets with Deep Preference Learning |
title | Integrating Incompatible Assay Data Sets with Deep
Preference Learning |
title_full | Integrating Incompatible Assay Data Sets with Deep
Preference Learning |
title_fullStr | Integrating Incompatible Assay Data Sets with Deep
Preference Learning |
title_full_unstemmed | Integrating Incompatible Assay Data Sets with Deep
Preference Learning |
title_short | Integrating Incompatible Assay Data Sets with Deep
Preference Learning |
title_sort | integrating incompatible assay data sets with deep
preference learning |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8762726/ https://www.ncbi.nlm.nih.gov/pubmed/35047110 http://dx.doi.org/10.1021/acsmedchemlett.1c00439 |
work_keys_str_mv | AT sunxiaolin integratingincompatibleassaydatasetswithdeeppreferencelearning AT tamuraryo integratingincompatibleassaydatasetswithdeeppreferencelearning AT sumitamasato integratingincompatibleassaydatasetswithdeeppreferencelearning AT morikenichi integratingincompatibleassaydatasetswithdeeppreferencelearning AT terayamakei integratingincompatibleassaydatasetswithdeeppreferencelearning AT tsudakoji integratingincompatibleassaydatasetswithdeeppreferencelearning |