Cargando…
AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders
BACKGROUND: Aptamers, which are biomaterials comprised of single-stranded DNA/RNA that form tertiary structures, have significant potential as next-generation materials, particularly for drug discovery. The systematic evolution of ligands by exponential enrichment (SELEX) method is a critical in vit...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10680337/ https://www.ncbi.nlm.nih.gov/pubmed/38012571 http://dx.doi.org/10.1186/s12859-023-05577-6 |
_version_ | 1785150705685757952 |
---|---|
author | Shin, Incheol Kang, Keumseok Kim, Juseong Sel, Sanghun Choi, Jeonghoon Lee, Jae-Wook Kang, Ho Young Song, Giltae |
author_facet | Shin, Incheol Kang, Keumseok Kim, Juseong Sel, Sanghun Choi, Jeonghoon Lee, Jae-Wook Kang, Ho Young Song, Giltae |
author_sort | Shin, Incheol |
collection | PubMed |
description | BACKGROUND: Aptamers, which are biomaterials comprised of single-stranded DNA/RNA that form tertiary structures, have significant potential as next-generation materials, particularly for drug discovery. The systematic evolution of ligands by exponential enrichment (SELEX) method is a critical in vitro technique employed to identify aptamers that bind specifically to target proteins. While advanced SELEX-based methods such as Cell- and HT-SELEX are available, they often encounter issues such as extended time consumption and suboptimal accuracy. Several In silico aptamer discovery methods have been proposed to address these challenges. These methods are specifically designed to predict aptamer-protein interaction (API) using benchmark datasets. However, these methods often fail to consider the physicochemical interactions between aptamers and proteins within tertiary structures. RESULTS: In this study, we propose AptaTrans, a pipeline for predicting API using deep learning techniques. AptaTrans uses transformer-based encoders to handle aptamer and protein sequences at the monomer level. Furthermore, pretrained encoders are utilized for the structural representation. After validation with a benchmark dataset, AptaTrans has been integrated into a comprehensive toolset. This pipeline synergistically combines with Apta-MCTS, a generative algorithm for recommending aptamer candidates. CONCLUSION: The results show that AptaTrans outperforms existing models for predicting API, and the efficacy of the AptaTrans pipeline has been confirmed through various experimental tools. We expect AptaTrans will enhance the cost-effectiveness and efficiency of SELEX in drug discovery. The source code and benchmark dataset for AptaTrans are available at https://github.com/pnumlb/AptaTrans. |
format | Online Article Text |
id | pubmed-10680337 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-106803372023-11-27 AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders Shin, Incheol Kang, Keumseok Kim, Juseong Sel, Sanghun Choi, Jeonghoon Lee, Jae-Wook Kang, Ho Young Song, Giltae BMC Bioinformatics Research BACKGROUND: Aptamers, which are biomaterials comprised of single-stranded DNA/RNA that form tertiary structures, have significant potential as next-generation materials, particularly for drug discovery. The systematic evolution of ligands by exponential enrichment (SELEX) method is a critical in vitro technique employed to identify aptamers that bind specifically to target proteins. While advanced SELEX-based methods such as Cell- and HT-SELEX are available, they often encounter issues such as extended time consumption and suboptimal accuracy. Several In silico aptamer discovery methods have been proposed to address these challenges. These methods are specifically designed to predict aptamer-protein interaction (API) using benchmark datasets. However, these methods often fail to consider the physicochemical interactions between aptamers and proteins within tertiary structures. RESULTS: In this study, we propose AptaTrans, a pipeline for predicting API using deep learning techniques. AptaTrans uses transformer-based encoders to handle aptamer and protein sequences at the monomer level. Furthermore, pretrained encoders are utilized for the structural representation. After validation with a benchmark dataset, AptaTrans has been integrated into a comprehensive toolset. This pipeline synergistically combines with Apta-MCTS, a generative algorithm for recommending aptamer candidates. CONCLUSION: The results show that AptaTrans outperforms existing models for predicting API, and the efficacy of the AptaTrans pipeline has been confirmed through various experimental tools. We expect AptaTrans will enhance the cost-effectiveness and efficiency of SELEX in drug discovery. The source code and benchmark dataset for AptaTrans are available at https://github.com/pnumlb/AptaTrans. BioMed Central 2023-11-27 /pmc/articles/PMC10680337/ /pubmed/38012571 http://dx.doi.org/10.1186/s12859-023-05577-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Shin, Incheol Kang, Keumseok Kim, Juseong Sel, Sanghun Choi, Jeonghoon Lee, Jae-Wook Kang, Ho Young Song, Giltae AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders |
title | AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders |
title_full | AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders |
title_fullStr | AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders |
title_full_unstemmed | AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders |
title_short | AptaTrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders |
title_sort | aptatrans: a deep neural network for predicting aptamer-protein interaction using pretrained encoders |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10680337/ https://www.ncbi.nlm.nih.gov/pubmed/38012571 http://dx.doi.org/10.1186/s12859-023-05577-6 |
work_keys_str_mv | AT shinincheol aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders AT kangkeumseok aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders AT kimjuseong aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders AT selsanghun aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders AT choijeonghoon aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders AT leejaewook aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders AT kanghoyoung aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders AT songgiltae aptatransadeepneuralnetworkforpredictingaptamerproteininteractionusingpretrainedencoders |