Cargando…

Ensemble-AHTPpred: A Robust Ensemble Machine Learning Model Integrated With a New Composite Feature for Identifying Antihypertensive Peptides

Hypertension or elevated blood pressure is a serious medical condition that significantly increases the risks of cardiovascular disease, heart disease, diabetes, stroke, kidney disease, and other health problems, that affect people worldwide. Thus, hypertension is one of the major global causes of p...

Descripción completa

Detalles Bibliográficos
Autores principales: Lertampaiporn, Supatcha, Hongsthong, Apiradee, Wattanapornprom, Warin, Thammarongtham, Chinae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9096110/
https://www.ncbi.nlm.nih.gov/pubmed/35571042
http://dx.doi.org/10.3389/fgene.2022.883766
_version_ 1784705902970929152
author Lertampaiporn, Supatcha
Hongsthong, Apiradee
Wattanapornprom, Warin
Thammarongtham, Chinae
author_facet Lertampaiporn, Supatcha
Hongsthong, Apiradee
Wattanapornprom, Warin
Thammarongtham, Chinae
author_sort Lertampaiporn, Supatcha
collection PubMed
description Hypertension or elevated blood pressure is a serious medical condition that significantly increases the risks of cardiovascular disease, heart disease, diabetes, stroke, kidney disease, and other health problems, that affect people worldwide. Thus, hypertension is one of the major global causes of premature death. Regarding the prevention and treatment of hypertension with no or few side effects, antihypertensive peptides (AHTPs) obtained from natural sources might be useful as nutraceuticals. Therefore, the search for alternative/novel AHTPs in food or natural sources has received much attention, as AHTPs may be functional agents for human health. AHTPs have been observed in diverse organisms, although many of them remain underinvestigated. The identification of peptides with antihypertensive activity in the laboratory is time- and resource-consuming. Alternatively, computational methods based on robust machine learning can identify or screen potential AHTP candidates prior to experimental verification. In this paper, we propose Ensemble-AHTPpred, an ensemble machine learning algorithm composed of a random forest (RF), a support vector machine (SVM), and extreme gradient boosting (XGB), with the aim of integrating diverse heterogeneous algorithms to enhance the robustness of the final predictive model. The selected feature set includes various computed features, such as various physicochemical properties, amino acid compositions (AACs), transitions, n-grams, and secondary structure-related information; these features are able to learn more information in terms of analyzing or explaining the characteristics of the predicted peptide. In addition, the tool is integrated with a newly proposed composite feature (generated based on a logistic regression function) that combines various feature aspects to enable improved AHTP characterization. Our tool, Ensemble-AHTPpred, achieved an overall accuracy above 90% on independent test data. Additionally, the approach was applied to novel experimentally validated AHTPs, obtained from recent studies, which did not overlap with the training and test datasets, and the tool could precisely predict these AHTPs.
format Online
Article
Text
id pubmed-9096110
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-90961102022-05-13 Ensemble-AHTPpred: A Robust Ensemble Machine Learning Model Integrated With a New Composite Feature for Identifying Antihypertensive Peptides Lertampaiporn, Supatcha Hongsthong, Apiradee Wattanapornprom, Warin Thammarongtham, Chinae Front Genet Genetics Hypertension or elevated blood pressure is a serious medical condition that significantly increases the risks of cardiovascular disease, heart disease, diabetes, stroke, kidney disease, and other health problems, that affect people worldwide. Thus, hypertension is one of the major global causes of premature death. Regarding the prevention and treatment of hypertension with no or few side effects, antihypertensive peptides (AHTPs) obtained from natural sources might be useful as nutraceuticals. Therefore, the search for alternative/novel AHTPs in food or natural sources has received much attention, as AHTPs may be functional agents for human health. AHTPs have been observed in diverse organisms, although many of them remain underinvestigated. The identification of peptides with antihypertensive activity in the laboratory is time- and resource-consuming. Alternatively, computational methods based on robust machine learning can identify or screen potential AHTP candidates prior to experimental verification. In this paper, we propose Ensemble-AHTPpred, an ensemble machine learning algorithm composed of a random forest (RF), a support vector machine (SVM), and extreme gradient boosting (XGB), with the aim of integrating diverse heterogeneous algorithms to enhance the robustness of the final predictive model. The selected feature set includes various computed features, such as various physicochemical properties, amino acid compositions (AACs), transitions, n-grams, and secondary structure-related information; these features are able to learn more information in terms of analyzing or explaining the characteristics of the predicted peptide. In addition, the tool is integrated with a newly proposed composite feature (generated based on a logistic regression function) that combines various feature aspects to enable improved AHTP characterization. Our tool, Ensemble-AHTPpred, achieved an overall accuracy above 90% on independent test data. Additionally, the approach was applied to novel experimentally validated AHTPs, obtained from recent studies, which did not overlap with the training and test datasets, and the tool could precisely predict these AHTPs. Frontiers Media S.A. 2022-04-28 /pmc/articles/PMC9096110/ /pubmed/35571042 http://dx.doi.org/10.3389/fgene.2022.883766 Text en Copyright © 2022 Lertampaiporn, Hongsthong, Wattanapornprom and Thammarongtham. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Lertampaiporn, Supatcha
Hongsthong, Apiradee
Wattanapornprom, Warin
Thammarongtham, Chinae
Ensemble-AHTPpred: A Robust Ensemble Machine Learning Model Integrated With a New Composite Feature for Identifying Antihypertensive Peptides
title Ensemble-AHTPpred: A Robust Ensemble Machine Learning Model Integrated With a New Composite Feature for Identifying Antihypertensive Peptides
title_full Ensemble-AHTPpred: A Robust Ensemble Machine Learning Model Integrated With a New Composite Feature for Identifying Antihypertensive Peptides
title_fullStr Ensemble-AHTPpred: A Robust Ensemble Machine Learning Model Integrated With a New Composite Feature for Identifying Antihypertensive Peptides
title_full_unstemmed Ensemble-AHTPpred: A Robust Ensemble Machine Learning Model Integrated With a New Composite Feature for Identifying Antihypertensive Peptides
title_short Ensemble-AHTPpred: A Robust Ensemble Machine Learning Model Integrated With a New Composite Feature for Identifying Antihypertensive Peptides
title_sort ensemble-ahtppred: a robust ensemble machine learning model integrated with a new composite feature for identifying antihypertensive peptides
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9096110/
https://www.ncbi.nlm.nih.gov/pubmed/35571042
http://dx.doi.org/10.3389/fgene.2022.883766
work_keys_str_mv AT lertampaipornsupatcha ensembleahtppredarobustensemblemachinelearningmodelintegratedwithanewcompositefeatureforidentifyingantihypertensivepeptides
AT hongsthongapiradee ensembleahtppredarobustensemblemachinelearningmodelintegratedwithanewcompositefeatureforidentifyingantihypertensivepeptides
AT wattanapornpromwarin ensembleahtppredarobustensemblemachinelearningmodelintegratedwithanewcompositefeatureforidentifyingantihypertensivepeptides
AT thammarongthamchinae ensembleahtppredarobustensemblemachinelearningmodelintegratedwithanewcompositefeatureforidentifyingantihypertensivepeptides