Cargando…

Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions

Excessive oxidative stress responses can threaten our health, and thus it is essential to produce antioxidant proteins to regulate the body’s oxidative responses. The low number of antioxidant proteins makes it difficult to extract their representative features. Our experimental method did not use s...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhai, Yixiao, Chen, Yu, Teng, Zhixia, Zhao, Yuming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7658297/
https://www.ncbi.nlm.nih.gov/pubmed/33195258
http://dx.doi.org/10.3389/fcell.2020.591487
_version_ 1783608638675353600
author Zhai, Yixiao
Chen, Yu
Teng, Zhixia
Zhao, Yuming
author_facet Zhai, Yixiao
Chen, Yu
Teng, Zhixia
Zhao, Yuming
author_sort Zhai, Yixiao
collection PubMed
description Excessive oxidative stress responses can threaten our health, and thus it is essential to produce antioxidant proteins to regulate the body’s oxidative responses. The low number of antioxidant proteins makes it difficult to extract their representative features. Our experimental method did not use structural information but instead studied antioxidant proteins from a sequenced perspective while focusing on the impact of data imbalance on sensitivity, thus greatly improving the model’s sensitivity for antioxidant protein recognition. We developed a method based on the Composition of k-spaced Amino Acid Pairs (CKSAAP) and the Conjoint Triad (CT) features derived from the amino acid composition and protein-protein interactions. SMOTE and the Max-Relevance-Max-Distance algorithm (MRMD) were utilized to unbalance the training data and select the optimal feature subset, respectively. The test set used 10-fold crossing validation and a random forest algorithm for classification according to the selected feature subset. The sensitivity was 0.792, the specificity was 0.808, and the average accuracy was 0.8.
format Online
Article
Text
id pubmed-7658297
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-76582972020-11-13 Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions Zhai, Yixiao Chen, Yu Teng, Zhixia Zhao, Yuming Front Cell Dev Biol Cell and Developmental Biology Excessive oxidative stress responses can threaten our health, and thus it is essential to produce antioxidant proteins to regulate the body’s oxidative responses. The low number of antioxidant proteins makes it difficult to extract their representative features. Our experimental method did not use structural information but instead studied antioxidant proteins from a sequenced perspective while focusing on the impact of data imbalance on sensitivity, thus greatly improving the model’s sensitivity for antioxidant protein recognition. We developed a method based on the Composition of k-spaced Amino Acid Pairs (CKSAAP) and the Conjoint Triad (CT) features derived from the amino acid composition and protein-protein interactions. SMOTE and the Max-Relevance-Max-Distance algorithm (MRMD) were utilized to unbalance the training data and select the optimal feature subset, respectively. The test set used 10-fold crossing validation and a random forest algorithm for classification according to the selected feature subset. The sensitivity was 0.792, the specificity was 0.808, and the average accuracy was 0.8. Frontiers Media S.A. 2020-10-29 /pmc/articles/PMC7658297/ /pubmed/33195258 http://dx.doi.org/10.3389/fcell.2020.591487 Text en Copyright © 2020 Zhai, Chen, Teng and Zhao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Zhai, Yixiao
Chen, Yu
Teng, Zhixia
Zhao, Yuming
Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions
title Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions
title_full Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions
title_fullStr Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions
title_full_unstemmed Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions
title_short Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions
title_sort identifying antioxidant proteins by using amino acid composition and protein-protein interactions
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7658297/
https://www.ncbi.nlm.nih.gov/pubmed/33195258
http://dx.doi.org/10.3389/fcell.2020.591487
work_keys_str_mv AT zhaiyixiao identifyingantioxidantproteinsbyusingaminoacidcompositionandproteinproteininteractions
AT chenyu identifyingantioxidantproteinsbyusingaminoacidcompositionandproteinproteininteractions
AT tengzhixia identifyingantioxidantproteinsbyusingaminoacidcompositionandproteinproteininteractions
AT zhaoyuming identifyingantioxidantproteinsbyusingaminoacidcompositionandproteinproteininteractions