Cargando…

Integrating Incompatible Assay Data Sets with Deep Preference Learning

[Image: see text] A large amount of bioactivity assay data is already accumulated in public databases, but the integration of these data sets for quantitative structure–activity relationship (QSAR) studies is not straightforward due to differences in experimental methods and settings. We present an...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Xiaolin, Tamura, Ryo, Sumita, Masato, Mori, Kenichi, Terayama, Kei, Tsuda, Koji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8762726/
https://www.ncbi.nlm.nih.gov/pubmed/35047110
http://dx.doi.org/10.1021/acsmedchemlett.1c00439
_version_ 1784633824429211648
author Sun, Xiaolin
Tamura, Ryo
Sumita, Masato
Mori, Kenichi
Terayama, Kei
Tsuda, Koji
author_facet Sun, Xiaolin
Tamura, Ryo
Sumita, Masato
Mori, Kenichi
Terayama, Kei
Tsuda, Koji
author_sort Sun, Xiaolin
collection PubMed
description [Image: see text] A large amount of bioactivity assay data is already accumulated in public databases, but the integration of these data sets for quantitative structure–activity relationship (QSAR) studies is not straightforward due to differences in experimental methods and settings. We present an efficient deep-learning-based approach called Deep Preference Data Integration (DPDI). For integrating outcome variables of different assay types, a surrogate variable is introduced, and a neural network is trained such that the total order induced by the surrogate variable is maximally consistent with given data sets. In a task of predicting efficacy of factor Xa inhibitors, DPDI successfully integrated 2959 molecules distributed in 129 assay data sets. In most of our experiments, data integration improved prediction accuracy strongly in interpolation and extrapolation tasks, indicating that DPDI is an effective tool for QSAR studies.
format Online
Article
Text
id pubmed-8762726
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-87627262022-01-18 Integrating Incompatible Assay Data Sets with Deep Preference Learning Sun, Xiaolin Tamura, Ryo Sumita, Masato Mori, Kenichi Terayama, Kei Tsuda, Koji ACS Med Chem Lett [Image: see text] A large amount of bioactivity assay data is already accumulated in public databases, but the integration of these data sets for quantitative structure–activity relationship (QSAR) studies is not straightforward due to differences in experimental methods and settings. We present an efficient deep-learning-based approach called Deep Preference Data Integration (DPDI). For integrating outcome variables of different assay types, a surrogate variable is introduced, and a neural network is trained such that the total order induced by the surrogate variable is maximally consistent with given data sets. In a task of predicting efficacy of factor Xa inhibitors, DPDI successfully integrated 2959 molecules distributed in 129 assay data sets. In most of our experiments, data integration improved prediction accuracy strongly in interpolation and extrapolation tasks, indicating that DPDI is an effective tool for QSAR studies. American Chemical Society 2021-12-29 /pmc/articles/PMC8762726/ /pubmed/35047110 http://dx.doi.org/10.1021/acsmedchemlett.1c00439 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Sun, Xiaolin
Tamura, Ryo
Sumita, Masato
Mori, Kenichi
Terayama, Kei
Tsuda, Koji
Integrating Incompatible Assay Data Sets with Deep Preference Learning
title Integrating Incompatible Assay Data Sets with Deep Preference Learning
title_full Integrating Incompatible Assay Data Sets with Deep Preference Learning
title_fullStr Integrating Incompatible Assay Data Sets with Deep Preference Learning
title_full_unstemmed Integrating Incompatible Assay Data Sets with Deep Preference Learning
title_short Integrating Incompatible Assay Data Sets with Deep Preference Learning
title_sort integrating incompatible assay data sets with deep preference learning
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8762726/
https://www.ncbi.nlm.nih.gov/pubmed/35047110
http://dx.doi.org/10.1021/acsmedchemlett.1c00439
work_keys_str_mv AT sunxiaolin integratingincompatibleassaydatasetswithdeeppreferencelearning
AT tamuraryo integratingincompatibleassaydatasetswithdeeppreferencelearning
AT sumitamasato integratingincompatibleassaydatasetswithdeeppreferencelearning
AT morikenichi integratingincompatibleassaydatasetswithdeeppreferencelearning
AT terayamakei integratingincompatibleassaydatasetswithdeeppreferencelearning
AT tsudakoji integratingincompatibleassaydatasetswithdeeppreferencelearning