Cargando…
Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions
Excessive oxidative stress responses can threaten our health, and thus it is essential to produce antioxidant proteins to regulate the body’s oxidative responses. The low number of antioxidant proteins makes it difficult to extract their representative features. Our experimental method did not use s...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7658297/ https://www.ncbi.nlm.nih.gov/pubmed/33195258 http://dx.doi.org/10.3389/fcell.2020.591487 |
_version_ | 1783608638675353600 |
---|---|
author | Zhai, Yixiao Chen, Yu Teng, Zhixia Zhao, Yuming |
author_facet | Zhai, Yixiao Chen, Yu Teng, Zhixia Zhao, Yuming |
author_sort | Zhai, Yixiao |
collection | PubMed |
description | Excessive oxidative stress responses can threaten our health, and thus it is essential to produce antioxidant proteins to regulate the body’s oxidative responses. The low number of antioxidant proteins makes it difficult to extract their representative features. Our experimental method did not use structural information but instead studied antioxidant proteins from a sequenced perspective while focusing on the impact of data imbalance on sensitivity, thus greatly improving the model’s sensitivity for antioxidant protein recognition. We developed a method based on the Composition of k-spaced Amino Acid Pairs (CKSAAP) and the Conjoint Triad (CT) features derived from the amino acid composition and protein-protein interactions. SMOTE and the Max-Relevance-Max-Distance algorithm (MRMD) were utilized to unbalance the training data and select the optimal feature subset, respectively. The test set used 10-fold crossing validation and a random forest algorithm for classification according to the selected feature subset. The sensitivity was 0.792, the specificity was 0.808, and the average accuracy was 0.8. |
format | Online Article Text |
id | pubmed-7658297 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-76582972020-11-13 Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions Zhai, Yixiao Chen, Yu Teng, Zhixia Zhao, Yuming Front Cell Dev Biol Cell and Developmental Biology Excessive oxidative stress responses can threaten our health, and thus it is essential to produce antioxidant proteins to regulate the body’s oxidative responses. The low number of antioxidant proteins makes it difficult to extract their representative features. Our experimental method did not use structural information but instead studied antioxidant proteins from a sequenced perspective while focusing on the impact of data imbalance on sensitivity, thus greatly improving the model’s sensitivity for antioxidant protein recognition. We developed a method based on the Composition of k-spaced Amino Acid Pairs (CKSAAP) and the Conjoint Triad (CT) features derived from the amino acid composition and protein-protein interactions. SMOTE and the Max-Relevance-Max-Distance algorithm (MRMD) were utilized to unbalance the training data and select the optimal feature subset, respectively. The test set used 10-fold crossing validation and a random forest algorithm for classification according to the selected feature subset. The sensitivity was 0.792, the specificity was 0.808, and the average accuracy was 0.8. Frontiers Media S.A. 2020-10-29 /pmc/articles/PMC7658297/ /pubmed/33195258 http://dx.doi.org/10.3389/fcell.2020.591487 Text en Copyright © 2020 Zhai, Chen, Teng and Zhao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cell and Developmental Biology Zhai, Yixiao Chen, Yu Teng, Zhixia Zhao, Yuming Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions |
title | Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions |
title_full | Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions |
title_fullStr | Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions |
title_full_unstemmed | Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions |
title_short | Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions |
title_sort | identifying antioxidant proteins by using amino acid composition and protein-protein interactions |
topic | Cell and Developmental Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7658297/ https://www.ncbi.nlm.nih.gov/pubmed/33195258 http://dx.doi.org/10.3389/fcell.2020.591487 |
work_keys_str_mv | AT zhaiyixiao identifyingantioxidantproteinsbyusingaminoacidcompositionandproteinproteininteractions AT chenyu identifyingantioxidantproteinsbyusingaminoacidcompositionandproteinproteininteractions AT tengzhixia identifyingantioxidantproteinsbyusingaminoacidcompositionandproteinproteininteractions AT zhaoyuming identifyingantioxidantproteinsbyusingaminoacidcompositionandproteinproteininteractions |