Cargando…

Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques

BACKGROUND: Cyclic peptide-based drug discovery is attracting increasing interest owing to its potential to avoid target protein depletion. In drug discovery, it is important to maintain the biostability of a drug within the proper range. Plasma protein binding (PPB) is the most important index of b...

Descripción completa

Detalles Bibliográficos
Autores principales: Tajimi, Takashi, Wakui, Naoki, Yanagisawa, Keisuke, Yoshikawa, Yasushi, Ohue, Masahito, Akiyama, Yutaka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311893/
https://www.ncbi.nlm.nih.gov/pubmed/30598072
http://dx.doi.org/10.1186/s12859-018-2529-z
_version_ 1783383694490206208
author Tajimi, Takashi
Wakui, Naoki
Yanagisawa, Keisuke
Yoshikawa, Yasushi
Ohue, Masahito
Akiyama, Yutaka
author_facet Tajimi, Takashi
Wakui, Naoki
Yanagisawa, Keisuke
Yoshikawa, Yasushi
Ohue, Masahito
Akiyama, Yutaka
author_sort Tajimi, Takashi
collection PubMed
description BACKGROUND: Cyclic peptide-based drug discovery is attracting increasing interest owing to its potential to avoid target protein depletion. In drug discovery, it is important to maintain the biostability of a drug within the proper range. Plasma protein binding (PPB) is the most important index of biostability, and developing a computational method to predict PPB of drug candidate compounds contributes to the acceleration of drug discovery research. PPB prediction of small molecule drug compounds using machine learning has been conducted thus far; however, no study has investigated cyclic peptides because experimental information of cyclic peptides is scarce. RESULTS: First, we adopted sparse modeling and small molecule information to construct a PPB prediction model for cyclic peptides. As cyclic peptide data are limited, applying multidimensional nonlinear models involves concerns regarding overfitting. However, models constructed by sparse modeling can avoid overfitting, offering high generalization performance and interpretability. More than 1000 PPB data of small molecules are available, and we used them to construct a prediction models with two enumeration methods: enumerating lasso solutions (ELS) and forward beam search (FBS). The accuracies of the prediction models constructed by ELS and FBS were equal to or better than those of conventional non-linear models (MAE = 0.167–0.174) on cross-validation of a small molecule compound dataset. Moreover, we showed that the prediction accuracies for cyclic peptides were close to those for small molecule compounds (MAE = 0.194–0.288). Such high accuracy could not be obtained by a simple method of learning from cyclic peptide data directly by lasso regression (MAE = 0.286–0.671) or ridge regression (MAE = 0.244–0.354). CONCLUSION: In this study, we proposed a machine learning techniques that uses low-dimensional sparse modeling to predict the PPB value of cyclic peptides computationally. The low-dimensional sparse model not only exhibits excellent generalization performance but also improves interpretation of the prediction model. This can provide common an noteworthy knowledge for future cyclic peptide drug discovery studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2529-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6311893
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63118932019-01-07 Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques Tajimi, Takashi Wakui, Naoki Yanagisawa, Keisuke Yoshikawa, Yasushi Ohue, Masahito Akiyama, Yutaka BMC Bioinformatics Research BACKGROUND: Cyclic peptide-based drug discovery is attracting increasing interest owing to its potential to avoid target protein depletion. In drug discovery, it is important to maintain the biostability of a drug within the proper range. Plasma protein binding (PPB) is the most important index of biostability, and developing a computational method to predict PPB of drug candidate compounds contributes to the acceleration of drug discovery research. PPB prediction of small molecule drug compounds using machine learning has been conducted thus far; however, no study has investigated cyclic peptides because experimental information of cyclic peptides is scarce. RESULTS: First, we adopted sparse modeling and small molecule information to construct a PPB prediction model for cyclic peptides. As cyclic peptide data are limited, applying multidimensional nonlinear models involves concerns regarding overfitting. However, models constructed by sparse modeling can avoid overfitting, offering high generalization performance and interpretability. More than 1000 PPB data of small molecules are available, and we used them to construct a prediction models with two enumeration methods: enumerating lasso solutions (ELS) and forward beam search (FBS). The accuracies of the prediction models constructed by ELS and FBS were equal to or better than those of conventional non-linear models (MAE = 0.167–0.174) on cross-validation of a small molecule compound dataset. Moreover, we showed that the prediction accuracies for cyclic peptides were close to those for small molecule compounds (MAE = 0.194–0.288). Such high accuracy could not be obtained by a simple method of learning from cyclic peptide data directly by lasso regression (MAE = 0.286–0.671) or ridge regression (MAE = 0.244–0.354). CONCLUSION: In this study, we proposed a machine learning techniques that uses low-dimensional sparse modeling to predict the PPB value of cyclic peptides computationally. The low-dimensional sparse model not only exhibits excellent generalization performance but also improves interpretation of the prediction model. This can provide common an noteworthy knowledge for future cyclic peptide drug discovery studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2529-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-31 /pmc/articles/PMC6311893/ /pubmed/30598072 http://dx.doi.org/10.1186/s12859-018-2529-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Tajimi, Takashi
Wakui, Naoki
Yanagisawa, Keisuke
Yoshikawa, Yasushi
Ohue, Masahito
Akiyama, Yutaka
Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques
title Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques
title_full Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques
title_fullStr Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques
title_full_unstemmed Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques
title_short Computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques
title_sort computational prediction of plasma protein binding of cyclic peptides from small molecule experimental data using sparse modeling techniques
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6311893/
https://www.ncbi.nlm.nih.gov/pubmed/30598072
http://dx.doi.org/10.1186/s12859-018-2529-z
work_keys_str_mv AT tajimitakashi computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques
AT wakuinaoki computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques
AT yanagisawakeisuke computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques
AT yoshikawayasushi computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques
AT ohuemasahito computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques
AT akiyamayutaka computationalpredictionofplasmaproteinbindingofcyclicpeptidesfromsmallmoleculeexperimentaldatausingsparsemodelingtechniques