Cargando…
Research progress of reduced amino acid alphabets in protein analysis and prediction
Proteins are the executors of cellular physiological activities, and accurate structural and function elucidation are crucial for the refined mapping of proteins. As a feature engineering method, the reduction of amino acid composition is not only an important method for protein structure and functi...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9284397/ https://www.ncbi.nlm.nih.gov/pubmed/35860409 http://dx.doi.org/10.1016/j.csbj.2022.07.001 |
_version_ | 1784747554089467904 |
---|---|
author | Liang, Yuchao Yang, Siqi Zheng, Lei Wang, Hao Zhou, Jian Huang, Shenghui Yang, Lei Zuo, Yongchun |
author_facet | Liang, Yuchao Yang, Siqi Zheng, Lei Wang, Hao Zhou, Jian Huang, Shenghui Yang, Lei Zuo, Yongchun |
author_sort | Liang, Yuchao |
collection | PubMed |
description | Proteins are the executors of cellular physiological activities, and accurate structural and function elucidation are crucial for the refined mapping of proteins. As a feature engineering method, the reduction of amino acid composition is not only an important method for protein structure and function analysis, but also opens a broad horizon for the complex field of machine learning. Representing sequences with fewer amino acid types greatly reduces the complexity and noise of traditional feature engineering in dimension, and provides more interpretable predictive models for machine learning to capture key features. In this paper, we systematically reviewed the strategy and method studies of the reduced amino acid (RAA) alphabets, and summarized its main research in protein sequence alignment, functional classification, and prediction of structural properties, respectively. In the end, we gave a comprehensive analysis of 672 RAA alphabets from 74 reduction methods. |
format | Online Article Text |
id | pubmed-9284397 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-92843972022-07-19 Research progress of reduced amino acid alphabets in protein analysis and prediction Liang, Yuchao Yang, Siqi Zheng, Lei Wang, Hao Zhou, Jian Huang, Shenghui Yang, Lei Zuo, Yongchun Comput Struct Biotechnol J Mini Review Proteins are the executors of cellular physiological activities, and accurate structural and function elucidation are crucial for the refined mapping of proteins. As a feature engineering method, the reduction of amino acid composition is not only an important method for protein structure and function analysis, but also opens a broad horizon for the complex field of machine learning. Representing sequences with fewer amino acid types greatly reduces the complexity and noise of traditional feature engineering in dimension, and provides more interpretable predictive models for machine learning to capture key features. In this paper, we systematically reviewed the strategy and method studies of the reduced amino acid (RAA) alphabets, and summarized its main research in protein sequence alignment, functional classification, and prediction of structural properties, respectively. In the end, we gave a comprehensive analysis of 672 RAA alphabets from 74 reduction methods. Research Network of Computational and Structural Biotechnology 2022-07-04 /pmc/articles/PMC9284397/ /pubmed/35860409 http://dx.doi.org/10.1016/j.csbj.2022.07.001 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Mini Review Liang, Yuchao Yang, Siqi Zheng, Lei Wang, Hao Zhou, Jian Huang, Shenghui Yang, Lei Zuo, Yongchun Research progress of reduced amino acid alphabets in protein analysis and prediction |
title | Research progress of reduced amino acid alphabets in protein analysis and prediction |
title_full | Research progress of reduced amino acid alphabets in protein analysis and prediction |
title_fullStr | Research progress of reduced amino acid alphabets in protein analysis and prediction |
title_full_unstemmed | Research progress of reduced amino acid alphabets in protein analysis and prediction |
title_short | Research progress of reduced amino acid alphabets in protein analysis and prediction |
title_sort | research progress of reduced amino acid alphabets in protein analysis and prediction |
topic | Mini Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9284397/ https://www.ncbi.nlm.nih.gov/pubmed/35860409 http://dx.doi.org/10.1016/j.csbj.2022.07.001 |
work_keys_str_mv | AT liangyuchao researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction AT yangsiqi researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction AT zhenglei researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction AT wanghao researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction AT zhoujian researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction AT huangshenghui researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction AT yanglei researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction AT zuoyongchun researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction |