Cargando…

Prospection of Peptide Inhibitors of Thrombin from Diverse Origins Using a Machine Learning Pipeline

Thrombin is a key enzyme involved in the development and progression of many cardiovascular diseases. Direct thrombin inhibitors (DTIs), with their minimum off-target effects and immediacy of action, have greatly improved the treatment of these diseases. However, the risk of bleeding, pharmacokineti...

Descripción completa

Detalles Bibliográficos
Autores principales: Balakrishnan, Nivedha, Katkar, Rahul, Pham, Peter V., Downey, Taylor, Kashyap, Prarthna, Anastasiu, David C., Ramasubramanian, Anand K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669389/
https://www.ncbi.nlm.nih.gov/pubmed/38002424
http://dx.doi.org/10.3390/bioengineering10111300
_version_ 1785139685170872320
author Balakrishnan, Nivedha
Katkar, Rahul
Pham, Peter V.
Downey, Taylor
Kashyap, Prarthna
Anastasiu, David C.
Ramasubramanian, Anand K.
author_facet Balakrishnan, Nivedha
Katkar, Rahul
Pham, Peter V.
Downey, Taylor
Kashyap, Prarthna
Anastasiu, David C.
Ramasubramanian, Anand K.
author_sort Balakrishnan, Nivedha
collection PubMed
description Thrombin is a key enzyme involved in the development and progression of many cardiovascular diseases. Direct thrombin inhibitors (DTIs), with their minimum off-target effects and immediacy of action, have greatly improved the treatment of these diseases. However, the risk of bleeding, pharmacokinetic issues, and thrombotic complications remain major concerns. In an effort to increase the effectiveness of the DTI discovery pipeline, we developed a two-stage machine learning pipeline to identify and rank peptide sequences based on their effective thrombin inhibitory potential. The positive dataset for our model consisted of thrombin inhibitor peptides and their binding affinities (K(I)) curated from published literature, and the negative dataset consisted of peptides with no known thrombin inhibitory or related activity. The first stage of the model identified thrombin inhibitory sequences with Matthew’s Correlation Coefficient (MCC) of 83.6%. The second stage of the model, which covers an eight-order of magnitude range in K(I) values, predicted the binding affinity of new sequences with a log room mean square error (RMSE) of 1.114. These models also revealed physicochemical and structural characteristics that are hidden but unique to thrombin inhibitor peptides. Using the model, we classified more than 10 million peptides from diverse sources and identified unique short peptide sequences (<15 aa) of interest, based on their predicted K(I). Based on the binding energies of the interaction of the peptide with thrombin, we identified a promising set of putative DTI candidates. The prediction pipeline is available on a web server.
format Online
Article
Text
id pubmed-10669389
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106693892023-11-09 Prospection of Peptide Inhibitors of Thrombin from Diverse Origins Using a Machine Learning Pipeline Balakrishnan, Nivedha Katkar, Rahul Pham, Peter V. Downey, Taylor Kashyap, Prarthna Anastasiu, David C. Ramasubramanian, Anand K. Bioengineering (Basel) Article Thrombin is a key enzyme involved in the development and progression of many cardiovascular diseases. Direct thrombin inhibitors (DTIs), with their minimum off-target effects and immediacy of action, have greatly improved the treatment of these diseases. However, the risk of bleeding, pharmacokinetic issues, and thrombotic complications remain major concerns. In an effort to increase the effectiveness of the DTI discovery pipeline, we developed a two-stage machine learning pipeline to identify and rank peptide sequences based on their effective thrombin inhibitory potential. The positive dataset for our model consisted of thrombin inhibitor peptides and their binding affinities (K(I)) curated from published literature, and the negative dataset consisted of peptides with no known thrombin inhibitory or related activity. The first stage of the model identified thrombin inhibitory sequences with Matthew’s Correlation Coefficient (MCC) of 83.6%. The second stage of the model, which covers an eight-order of magnitude range in K(I) values, predicted the binding affinity of new sequences with a log room mean square error (RMSE) of 1.114. These models also revealed physicochemical and structural characteristics that are hidden but unique to thrombin inhibitor peptides. Using the model, we classified more than 10 million peptides from diverse sources and identified unique short peptide sequences (<15 aa) of interest, based on their predicted K(I). Based on the binding energies of the interaction of the peptide with thrombin, we identified a promising set of putative DTI candidates. The prediction pipeline is available on a web server. MDPI 2023-11-09 /pmc/articles/PMC10669389/ /pubmed/38002424 http://dx.doi.org/10.3390/bioengineering10111300 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Balakrishnan, Nivedha
Katkar, Rahul
Pham, Peter V.
Downey, Taylor
Kashyap, Prarthna
Anastasiu, David C.
Ramasubramanian, Anand K.
Prospection of Peptide Inhibitors of Thrombin from Diverse Origins Using a Machine Learning Pipeline
title Prospection of Peptide Inhibitors of Thrombin from Diverse Origins Using a Machine Learning Pipeline
title_full Prospection of Peptide Inhibitors of Thrombin from Diverse Origins Using a Machine Learning Pipeline
title_fullStr Prospection of Peptide Inhibitors of Thrombin from Diverse Origins Using a Machine Learning Pipeline
title_full_unstemmed Prospection of Peptide Inhibitors of Thrombin from Diverse Origins Using a Machine Learning Pipeline
title_short Prospection of Peptide Inhibitors of Thrombin from Diverse Origins Using a Machine Learning Pipeline
title_sort prospection of peptide inhibitors of thrombin from diverse origins using a machine learning pipeline
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669389/
https://www.ncbi.nlm.nih.gov/pubmed/38002424
http://dx.doi.org/10.3390/bioengineering10111300
work_keys_str_mv AT balakrishnannivedha prospectionofpeptideinhibitorsofthrombinfromdiverseoriginsusingamachinelearningpipeline
AT katkarrahul prospectionofpeptideinhibitorsofthrombinfromdiverseoriginsusingamachinelearningpipeline
AT phampeterv prospectionofpeptideinhibitorsofthrombinfromdiverseoriginsusingamachinelearningpipeline
AT downeytaylor prospectionofpeptideinhibitorsofthrombinfromdiverseoriginsusingamachinelearningpipeline
AT kashyapprarthna prospectionofpeptideinhibitorsofthrombinfromdiverseoriginsusingamachinelearningpipeline
AT anastasiudavidc prospectionofpeptideinhibitorsofthrombinfromdiverseoriginsusingamachinelearningpipeline
AT ramasubramaniananandk prospectionofpeptideinhibitorsofthrombinfromdiverseoriginsusingamachinelearningpipeline