Cargando…

iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest

Recently, several anti-inflammatory peptides (AIPs) have been found in the process of the inflammatory response, and these peptides have been used to treat some inflammatory and autoimmune diseases. Therefore, identifying AIPs accurately from a given amino acid sequences is critical for the discover...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Dongxu, Teng, Zhixia, Li, Yanjuan, Chen, Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8669811/
https://www.ncbi.nlm.nih.gov/pubmed/34917130
http://dx.doi.org/10.3389/fgene.2021.773202
_version_ 1784614853899452416
author Zhao, Dongxu
Teng, Zhixia
Li, Yanjuan
Chen, Dong
author_facet Zhao, Dongxu
Teng, Zhixia
Li, Yanjuan
Chen, Dong
author_sort Zhao, Dongxu
collection PubMed
description Recently, several anti-inflammatory peptides (AIPs) have been found in the process of the inflammatory response, and these peptides have been used to treat some inflammatory and autoimmune diseases. Therefore, identifying AIPs accurately from a given amino acid sequences is critical for the discovery of novel and efficient anti-inflammatory peptide-based therapeutics and the acceleration of their application in therapy. In this paper, a random forest-based model called iAIPs for identifying AIPs is proposed. First, the original samples were encoded with three feature extraction methods, including g-gap dipeptide composition (GDC), dipeptide deviation from the expected mean (DDE), and amino acid composition (AAC). Second, the optimal feature subset is generated by a two-step feature selection method, in which the feature is ranked by the analysis of variance (ANOVA) method, and the optimal feature subset is generated by the incremental feature selection strategy. Finally, the optimal feature subset is inputted into the random forest classifier, and the identification model is constructed. Experiment results showed that iAIPs achieved an AUC value of 0.822 on an independent test dataset, which indicated that our proposed model has better performance than the existing methods. Furthermore, the extraction of features for peptide sequences provides the basis for evolutionary analysis. The study of peptide identification is helpful to understand the diversity of species and analyze the evolutionary history of species.
format Online
Article
Text
id pubmed-8669811
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86698112021-12-15 iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest Zhao, Dongxu Teng, Zhixia Li, Yanjuan Chen, Dong Front Genet Genetics Recently, several anti-inflammatory peptides (AIPs) have been found in the process of the inflammatory response, and these peptides have been used to treat some inflammatory and autoimmune diseases. Therefore, identifying AIPs accurately from a given amino acid sequences is critical for the discovery of novel and efficient anti-inflammatory peptide-based therapeutics and the acceleration of their application in therapy. In this paper, a random forest-based model called iAIPs for identifying AIPs is proposed. First, the original samples were encoded with three feature extraction methods, including g-gap dipeptide composition (GDC), dipeptide deviation from the expected mean (DDE), and amino acid composition (AAC). Second, the optimal feature subset is generated by a two-step feature selection method, in which the feature is ranked by the analysis of variance (ANOVA) method, and the optimal feature subset is generated by the incremental feature selection strategy. Finally, the optimal feature subset is inputted into the random forest classifier, and the identification model is constructed. Experiment results showed that iAIPs achieved an AUC value of 0.822 on an independent test dataset, which indicated that our proposed model has better performance than the existing methods. Furthermore, the extraction of features for peptide sequences provides the basis for evolutionary analysis. The study of peptide identification is helpful to understand the diversity of species and analyze the evolutionary history of species. Frontiers Media S.A. 2021-11-30 /pmc/articles/PMC8669811/ /pubmed/34917130 http://dx.doi.org/10.3389/fgene.2021.773202 Text en Copyright © 2021 Zhao, Teng, Li and Chen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhao, Dongxu
Teng, Zhixia
Li, Yanjuan
Chen, Dong
iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest
title iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest
title_full iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest
title_fullStr iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest
title_full_unstemmed iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest
title_short iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest
title_sort iaips: identifying anti-inflammatory peptides using random forest
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8669811/
https://www.ncbi.nlm.nih.gov/pubmed/34917130
http://dx.doi.org/10.3389/fgene.2021.773202
work_keys_str_mv AT zhaodongxu iaipsidentifyingantiinflammatorypeptidesusingrandomforest
AT tengzhixia iaipsidentifyingantiinflammatorypeptidesusingrandomforest
AT liyanjuan iaipsidentifyingantiinflammatorypeptidesusingrandomforest
AT chendong iaipsidentifyingantiinflammatorypeptidesusingrandomforest