Cargando…
The sequence of amino acids as the basis for the model of biological activity of peptides
The algorithm of building up a model for the biological activity of peptides as a mathematical function of a sequence of amino acids is suggested. The general scheme is the following: The total set of available data is distributed into the active training set, passive training set, calibration set,...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7820519/ https://www.ncbi.nlm.nih.gov/pubmed/33500680 http://dx.doi.org/10.1007/s00214-020-02707-8 |
_version_ | 1783639233043365888 |
---|---|
author | Toropova, Alla P. Raškova, Maria Raška Jr., Ivan Toropov, Andrey A. |
author_facet | Toropova, Alla P. Raškova, Maria Raška Jr., Ivan Toropov, Andrey A. |
author_sort | Toropova, Alla P. |
collection | PubMed |
description | The algorithm of building up a model for the biological activity of peptides as a mathematical function of a sequence of amino acids is suggested. The general scheme is the following: The total set of available data is distributed into the active training set, passive training set, calibration set, and validation set. The training (both active and passive) and calibration sets are a system of generation of a model of biological activity where each amino acid obtains special correlation weight. The numerical data on the correlation weights calculated by the Monte Carlo method using the CORAL software (http://www.insilico.eu/coral). The target function aimed to give the best result for the calibration set (not for the training set). The final checkup of the model is carried out with data on the validation set (peptides, which are not visible during the creation of the model). Described computational experiments confirm the ability of the approach to be a tool for the design of predictive models for the biological activity of peptides (expressed by pIC50). |
format | Online Article Text |
id | pubmed-7820519 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-78205192021-01-22 The sequence of amino acids as the basis for the model of biological activity of peptides Toropova, Alla P. Raškova, Maria Raška Jr., Ivan Toropov, Andrey A. Theor Chem Acc Regular Article The algorithm of building up a model for the biological activity of peptides as a mathematical function of a sequence of amino acids is suggested. The general scheme is the following: The total set of available data is distributed into the active training set, passive training set, calibration set, and validation set. The training (both active and passive) and calibration sets are a system of generation of a model of biological activity where each amino acid obtains special correlation weight. The numerical data on the correlation weights calculated by the Monte Carlo method using the CORAL software (http://www.insilico.eu/coral). The target function aimed to give the best result for the calibration set (not for the training set). The final checkup of the model is carried out with data on the validation set (peptides, which are not visible during the creation of the model). Described computational experiments confirm the ability of the approach to be a tool for the design of predictive models for the biological activity of peptides (expressed by pIC50). Springer Berlin Heidelberg 2021-01-22 2021 /pmc/articles/PMC7820519/ /pubmed/33500680 http://dx.doi.org/10.1007/s00214-020-02707-8 Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH, DE part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Regular Article Toropova, Alla P. Raškova, Maria Raška Jr., Ivan Toropov, Andrey A. The sequence of amino acids as the basis for the model of biological activity of peptides |
title | The sequence of amino acids as the basis for the model of biological activity of peptides |
title_full | The sequence of amino acids as the basis for the model of biological activity of peptides |
title_fullStr | The sequence of amino acids as the basis for the model of biological activity of peptides |
title_full_unstemmed | The sequence of amino acids as the basis for the model of biological activity of peptides |
title_short | The sequence of amino acids as the basis for the model of biological activity of peptides |
title_sort | sequence of amino acids as the basis for the model of biological activity of peptides |
topic | Regular Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7820519/ https://www.ncbi.nlm.nih.gov/pubmed/33500680 http://dx.doi.org/10.1007/s00214-020-02707-8 |
work_keys_str_mv | AT toropovaallap thesequenceofaminoacidsasthebasisforthemodelofbiologicalactivityofpeptides AT raskovamaria thesequenceofaminoacidsasthebasisforthemodelofbiologicalactivityofpeptides AT raskajrivan thesequenceofaminoacidsasthebasisforthemodelofbiologicalactivityofpeptides AT toropovandreya thesequenceofaminoacidsasthebasisforthemodelofbiologicalactivityofpeptides AT toropovaallap sequenceofaminoacidsasthebasisforthemodelofbiologicalactivityofpeptides AT raskovamaria sequenceofaminoacidsasthebasisforthemodelofbiologicalactivityofpeptides AT raskajrivan sequenceofaminoacidsasthebasisforthemodelofbiologicalactivityofpeptides AT toropovandreya sequenceofaminoacidsasthebasisforthemodelofbiologicalactivityofpeptides |