Cargando…

AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders

BACKGROUND: Aptamers, which are biomaterials comprised of single-stranded DNA/RNA that form tertiary structures, have significant potential as next-generation materials, particularly for drug discovery. The systematic evolution of ligands by exponential enrichment (SELEX) method is a critical in vit...

Descripción completa

Detalles Bibliográficos
Autores principales: Shin, Incheol, Kang, Keumseok, Kim, Juseong, Sel, Sanghun, Choi, Jeonghoon, Lee, Jae-Wook, Kang, Ho Young, Song, Giltae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10680337/
https://www.ncbi.nlm.nih.gov/pubmed/38012571
http://dx.doi.org/10.1186/s12859-023-05577-6
_version_ 1785150705685757952
author Shin, Incheol
Kang, Keumseok
Kim, Juseong
Sel, Sanghun
Choi, Jeonghoon
Lee, Jae-Wook
Kang, Ho Young
Song, Giltae
author_facet Shin, Incheol
Kang, Keumseok
Kim, Juseong
Sel, Sanghun
Choi, Jeonghoon
Lee, Jae-Wook
Kang, Ho Young
Song, Giltae
author_sort Shin, Incheol
collection PubMed
description BACKGROUND: Aptamers, which are biomaterials comprised of single-stranded DNA/RNA that form tertiary structures, have significant potential as next-generation materials, particularly for drug discovery. The systematic evolution of ligands by exponential enrichment (SELEX) method is a critical in vitro technique employed to identify aptamers that bind specifically to target proteins. While advanced SELEX-based methods such as Cell- and HT-SELEX are available, they often encounter issues such as extended time consumption and suboptimal accuracy. Several In silico aptamer discovery methods have been proposed to address these challenges. These methods are specifically designed to predict aptamer-protein interaction (API) using benchmark datasets. However, these methods often fail to consider the physicochemical interactions between aptamers and proteins within tertiary structures. RESULTS: In this study, we propose AptaTrans, a pipeline for predicting API using deep learning techniques. AptaTrans uses transformer-based encoders to handle aptamer and protein sequences at the monomer level. Furthermore, pretrained encoders are utilized for the structural representation. After validation with a benchmark dataset, AptaTrans has been integrated into a comprehensive toolset. This pipeline synergistically combines with Apta-MCTS, a generative algorithm for recommending aptamer candidates. CONCLUSION: The results show that AptaTrans outperforms existing models for predicting API, and the efficacy of the AptaTrans pipeline has been confirmed through various experimental tools. We expect AptaTrans will enhance the cost-effectiveness and efficiency of SELEX in drug discovery. The source code and benchmark dataset for AptaTrans are available at https://github.com/pnumlb/AptaTrans.
format Online
Article
Text
id pubmed-10680337
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-106803372023-11-27 AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders Shin, Incheol Kang, Keumseok Kim, Juseong Sel, Sanghun Choi, Jeonghoon Lee, Jae-Wook Kang, Ho Young Song, Giltae BMC Bioinformatics Research BACKGROUND: Aptamers, which are biomaterials comprised of single-stranded DNA/RNA that form tertiary structures, have significant potential as next-generation materials, particularly for drug discovery. The systematic evolution of ligands by exponential enrichment (SELEX) method is a critical in vitro technique employed to identify aptamers that bind specifically to target proteins. While advanced SELEX-based methods such as Cell- and HT-SELEX are available, they often encounter issues such as extended time consumption and suboptimal accuracy. Several In silico aptamer discovery methods have been proposed to address these challenges. These methods are specifically designed to predict aptamer-protein interaction (API) using benchmark datasets. However, these methods often fail to consider the physicochemical interactions between aptamers and proteins within tertiary structures. RESULTS: In this study, we propose AptaTrans, a pipeline for predicting API using deep learning techniques. AptaTrans uses transformer-based encoders to handle aptamer and protein sequences at the monomer level. Furthermore, pretrained encoders are utilized for the structural representation. After validation with a benchmark dataset, AptaTrans has been integrated into a comprehensive toolset. This pipeline synergistically combines with Apta-MCTS, a generative algorithm for recommending aptamer candidates. CONCLUSION: The results show that AptaTrans outperforms existing models for predicting API, and the efficacy of the AptaTrans pipeline has been confirmed through various experimental tools. We expect AptaTrans will enhance the cost-effectiveness and efficiency of SELEX in drug discovery. The source code and benchmark dataset for AptaTrans are available at https://github.com/pnumlb/AptaTrans. BioMed Central 2023-11-27 /pmc/articles/PMC10680337/ /pubmed/38012571 http://dx.doi.org/10.1186/s12859-023-05577-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Shin, Incheol
Kang, Keumseok
Kim, Juseong
Sel, Sanghun
Choi, Jeonghoon
Lee, Jae-Wook
Kang, Ho Young
Song, Giltae
AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_full AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_fullStr AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_full_unstemmed AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_short AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
title_sort aptatrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10680337/
https://www.ncbi.nlm.nih.gov/pubmed/38012571
http://dx.doi.org/10.1186/s12859-023-05577-6
work_keys_str_mv AT shinincheol aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT kangkeumseok aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT kimjuseong aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT selsanghun aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT choijeonghoon aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT leejaewook aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT kanghoyoung aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders
AT songgiltae aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders