Cargando…
Molecular property prediction by contrastive learning with attention-guided positive sample selection
MOTIVATION: Predicting molecular properties is one of the fundamental problems in drug design and discovery. In recent years, self-supervised learning (SSL) has shown its promising performance in image recognition, natural language processing, and single-cell data analysis. Contrastive learning (CL)...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10188298/ https://www.ncbi.nlm.nih.gov/pubmed/37079731 http://dx.doi.org/10.1093/bioinformatics/btad258 |
_version_ | 1785042882943516672 |
---|---|
author | Wang, Jinxian Guan, Jihong Zhou, Shuigeng |
author_facet | Wang, Jinxian Guan, Jihong Zhou, Shuigeng |
author_sort | Wang, Jinxian |
collection | PubMed |
description | MOTIVATION: Predicting molecular properties is one of the fundamental problems in drug design and discovery. In recent years, self-supervised learning (SSL) has shown its promising performance in image recognition, natural language processing, and single-cell data analysis. Contrastive learning (CL) is a typical SSL method used to learn the features of data so that the trained model can more effectively distinguish the data. One important issue of CL is how to select positive samples for each training example, which will significantly impact the performance of CL. RESULTS: In this article, we propose a new method for molecular property prediction (MPP) by Contrastive Learning with Attention-guided Positive-sample Selection (CLAPS). First, we generate positive samples for each training example based on an attention-guided selection scheme. Second, we employ a Transformer encoder to extract latent feature vectors and compute the contrastive loss aiming to distinguish positive and negative sample pairs. Finally, we use the trained encoder for predicting molecular properties. Experiments on various benchmark datasets show that our approach outperforms the state-of-the-art (SOTA) methods in most cases. AVAILABILITY AND IMPLEMENTATION: The code is publicly available at https://github.com/wangjx22/CLAPS. |
format | Online Article Text |
id | pubmed-10188298 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101882982023-05-18 Molecular property prediction by contrastive learning with attention-guided positive sample selection Wang, Jinxian Guan, Jihong Zhou, Shuigeng Bioinformatics Original Paper MOTIVATION: Predicting molecular properties is one of the fundamental problems in drug design and discovery. In recent years, self-supervised learning (SSL) has shown its promising performance in image recognition, natural language processing, and single-cell data analysis. Contrastive learning (CL) is a typical SSL method used to learn the features of data so that the trained model can more effectively distinguish the data. One important issue of CL is how to select positive samples for each training example, which will significantly impact the performance of CL. RESULTS: In this article, we propose a new method for molecular property prediction (MPP) by Contrastive Learning with Attention-guided Positive-sample Selection (CLAPS). First, we generate positive samples for each training example based on an attention-guided selection scheme. Second, we employ a Transformer encoder to extract latent feature vectors and compute the contrastive loss aiming to distinguish positive and negative sample pairs. Finally, we use the trained encoder for predicting molecular properties. Experiments on various benchmark datasets show that our approach outperforms the state-of-the-art (SOTA) methods in most cases. AVAILABILITY AND IMPLEMENTATION: The code is publicly available at https://github.com/wangjx22/CLAPS. Oxford University Press 2023-04-20 /pmc/articles/PMC10188298/ /pubmed/37079731 http://dx.doi.org/10.1093/bioinformatics/btad258 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Wang, Jinxian Guan, Jihong Zhou, Shuigeng Molecular property prediction by contrastive learning with attention-guided positive sample selection |
title | Molecular property prediction by contrastive learning with attention-guided positive sample selection |
title_full | Molecular property prediction by contrastive learning with attention-guided positive sample selection |
title_fullStr | Molecular property prediction by contrastive learning with attention-guided positive sample selection |
title_full_unstemmed | Molecular property prediction by contrastive learning with attention-guided positive sample selection |
title_short | Molecular property prediction by contrastive learning with attention-guided positive sample selection |
title_sort | molecular property prediction by contrastive learning with attention-guided positive sample selection |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10188298/ https://www.ncbi.nlm.nih.gov/pubmed/37079731 http://dx.doi.org/10.1093/bioinformatics/btad258 |
work_keys_str_mv | AT wangjinxian molecularpropertypredictionbycontrastivelearningwithattentionguidedpositivesampleselection AT guanjihong molecularpropertypredictionbycontrastivelearningwithattentionguidedpositivesampleselection AT zhoushuigeng molecularpropertypredictionbycontrastivelearningwithattentionguidedpositivesampleselection |