Cargando…

Predicting aptamer sequences that interact with target proteins using an aptamer-protein interaction classifier and a Monte Carlo tree search approach

Oligonucleotide-based aptamers, which have a three-dimensional structure with a single-stranded fragment, feature various characteristics with respect to size, toxicity, and permeability. Accordingly, aptamers are advantageous in terms of diagnosis and treatment and are materials that can be produce...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Gwangho, Jang, Gun Hyuk, Kang, Ho Young, Song, Giltae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232527/
https://www.ncbi.nlm.nih.gov/pubmed/34170922
http://dx.doi.org/10.1371/journal.pone.0253760
_version_ 1783713656304828416
author Lee, Gwangho
Jang, Gun Hyuk
Kang, Ho Young
Song, Giltae
author_facet Lee, Gwangho
Jang, Gun Hyuk
Kang, Ho Young
Song, Giltae
author_sort Lee, Gwangho
collection PubMed
description Oligonucleotide-based aptamers, which have a three-dimensional structure with a single-stranded fragment, feature various characteristics with respect to size, toxicity, and permeability. Accordingly, aptamers are advantageous in terms of diagnosis and treatment and are materials that can be produced through relatively simple experiments. Systematic evolution of ligands by exponential enrichment (SELEX) is one of the most widely used experimental methods for generating aptamers; however, it is highly expensive and time-consuming. To reduce the related costs, recent studies have used in silico approaches, such as aptamer-protein interaction (API) classifiers that use sequence patterns to determine the binding affinity between RNA aptamers and proteins. Some of these methods generate candidate RNA aptamer sequences that bind to a target protein, but they are limited to producing candidates of a specific size. In this study, we present a machine learning approach for selecting candidate sequences of various sizes that have a high binding affinity for a specific sequence of a target protein. We applied the Monte Carlo tree search (MCTS) algorithm for generating the candidate sequences using a score function based on an API classifier. The tree structure that we designed with MCTS enables nucleotide sequence sampling, and the obtained sequences are potential aptamer candidates. We performed a quality assessment using the scores of docking simulations. Our validation datasets revealed that our model showed similar or better docking scores in ZDOCK docking simulations than the known aptamers. We expect that our method, which is size-independent and easy to use, can provide insights into searching for an appropriate aptamer sequence for a target protein during the simulation step of SELEX.
format Online
Article
Text
id pubmed-8232527
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-82325272021-07-07 Predicting aptamer sequences that interact with target proteins using an aptamer-protein interaction classifier and a Monte Carlo tree search approach Lee, Gwangho Jang, Gun Hyuk Kang, Ho Young Song, Giltae PLoS One Research Article Oligonucleotide-based aptamers, which have a three-dimensional structure with a single-stranded fragment, feature various characteristics with respect to size, toxicity, and permeability. Accordingly, aptamers are advantageous in terms of diagnosis and treatment and are materials that can be produced through relatively simple experiments. Systematic evolution of ligands by exponential enrichment (SELEX) is one of the most widely used experimental methods for generating aptamers; however, it is highly expensive and time-consuming. To reduce the related costs, recent studies have used in silico approaches, such as aptamer-protein interaction (API) classifiers that use sequence patterns to determine the binding affinity between RNA aptamers and proteins. Some of these methods generate candidate RNA aptamer sequences that bind to a target protein, but they are limited to producing candidates of a specific size. In this study, we present a machine learning approach for selecting candidate sequences of various sizes that have a high binding affinity for a specific sequence of a target protein. We applied the Monte Carlo tree search (MCTS) algorithm for generating the candidate sequences using a score function based on an API classifier. The tree structure that we designed with MCTS enables nucleotide sequence sampling, and the obtained sequences are potential aptamer candidates. We performed a quality assessment using the scores of docking simulations. Our validation datasets revealed that our model showed similar or better docking scores in ZDOCK docking simulations than the known aptamers. We expect that our method, which is size-independent and easy to use, can provide insights into searching for an appropriate aptamer sequence for a target protein during the simulation step of SELEX. Public Library of Science 2021-06-25 /pmc/articles/PMC8232527/ /pubmed/34170922 http://dx.doi.org/10.1371/journal.pone.0253760 Text en © 2021 Lee et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lee, Gwangho
Jang, Gun Hyuk
Kang, Ho Young
Song, Giltae
Predicting aptamer sequences that interact with target proteins using an aptamer-protein interaction classifier and a Monte Carlo tree search approach
title Predicting aptamer sequences that interact with target proteins using an aptamer-protein interaction classifier and a Monte Carlo tree search approach
title_full Predicting aptamer sequences that interact with target proteins using an aptamer-protein interaction classifier and a Monte Carlo tree search approach
title_fullStr Predicting aptamer sequences that interact with target proteins using an aptamer-protein interaction classifier and a Monte Carlo tree search approach
title_full_unstemmed Predicting aptamer sequences that interact with target proteins using an aptamer-protein interaction classifier and a Monte Carlo tree search approach
title_short Predicting aptamer sequences that interact with target proteins using an aptamer-protein interaction classifier and a Monte Carlo tree search approach
title_sort predicting aptamer sequences that interact with target proteins using an aptamer-protein interaction classifier and a monte carlo tree search approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232527/
https://www.ncbi.nlm.nih.gov/pubmed/34170922
http://dx.doi.org/10.1371/journal.pone.0253760
work_keys_str_mv AT leegwangho predictingaptamersequencesthatinteractwithtargetproteinsusinganaptamerproteininteractionclassifierandamontecarlotreesearchapproach
AT janggunhyuk predictingaptamersequencesthatinteractwithtargetproteinsusinganaptamerproteininteractionclassifierandamontecarlotreesearchapproach
AT kanghoyoung predictingaptamersequencesthatinteractwithtargetproteinsusinganaptamerproteininteractionclassifierandamontecarlotreesearchapproach
AT songgiltae predictingaptamersequencesthatinteractwithtargetproteinsusinganaptamerproteininteractionclassifierandamontecarlotreesearchapproach