Cargando…

Research progress of reduced amino acid alphabets in protein analysis and prediction

Proteins are the executors of cellular physiological activities, and accurate structural and function elucidation are crucial for the refined mapping of proteins. As a feature engineering method, the reduction of amino acid composition is not only an important method for protein structure and functi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Yuchao, Yang, Siqi, Zheng, Lei, Wang, Hao, Zhou, Jian, Huang, Shenghui, Yang, Lei, Zuo, Yongchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9284397/
https://www.ncbi.nlm.nih.gov/pubmed/35860409
http://dx.doi.org/10.1016/j.csbj.2022.07.001
_version_ 1784747554089467904
author Liang, Yuchao
Yang, Siqi
Zheng, Lei
Wang, Hao
Zhou, Jian
Huang, Shenghui
Yang, Lei
Zuo, Yongchun
author_facet Liang, Yuchao
Yang, Siqi
Zheng, Lei
Wang, Hao
Zhou, Jian
Huang, Shenghui
Yang, Lei
Zuo, Yongchun
author_sort Liang, Yuchao
collection PubMed
description Proteins are the executors of cellular physiological activities, and accurate structural and function elucidation are crucial for the refined mapping of proteins. As a feature engineering method, the reduction of amino acid composition is not only an important method for protein structure and function analysis, but also opens a broad horizon for the complex field of machine learning. Representing sequences with fewer amino acid types greatly reduces the complexity and noise of traditional feature engineering in dimension, and provides more interpretable predictive models for machine learning to capture key features. In this paper, we systematically reviewed the strategy and method studies of the reduced amino acid (RAA) alphabets, and summarized its main research in protein sequence alignment, functional classification, and prediction of structural properties, respectively. In the end, we gave a comprehensive analysis of 672 RAA alphabets from 74 reduction methods.
format Online
Article
Text
id pubmed-9284397
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-92843972022-07-19 Research progress of reduced amino acid alphabets in protein analysis and prediction Liang, Yuchao Yang, Siqi Zheng, Lei Wang, Hao Zhou, Jian Huang, Shenghui Yang, Lei Zuo, Yongchun Comput Struct Biotechnol J Mini Review Proteins are the executors of cellular physiological activities, and accurate structural and function elucidation are crucial for the refined mapping of proteins. As a feature engineering method, the reduction of amino acid composition is not only an important method for protein structure and function analysis, but also opens a broad horizon for the complex field of machine learning. Representing sequences with fewer amino acid types greatly reduces the complexity and noise of traditional feature engineering in dimension, and provides more interpretable predictive models for machine learning to capture key features. In this paper, we systematically reviewed the strategy and method studies of the reduced amino acid (RAA) alphabets, and summarized its main research in protein sequence alignment, functional classification, and prediction of structural properties, respectively. In the end, we gave a comprehensive analysis of 672 RAA alphabets from 74 reduction methods. Research Network of Computational and Structural Biotechnology 2022-07-04 /pmc/articles/PMC9284397/ /pubmed/35860409 http://dx.doi.org/10.1016/j.csbj.2022.07.001 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Mini Review
Liang, Yuchao
Yang, Siqi
Zheng, Lei
Wang, Hao
Zhou, Jian
Huang, Shenghui
Yang, Lei
Zuo, Yongchun
Research progress of reduced amino acid alphabets in protein analysis and prediction
title Research progress of reduced amino acid alphabets in protein analysis and prediction
title_full Research progress of reduced amino acid alphabets in protein analysis and prediction
title_fullStr Research progress of reduced amino acid alphabets in protein analysis and prediction
title_full_unstemmed Research progress of reduced amino acid alphabets in protein analysis and prediction
title_short Research progress of reduced amino acid alphabets in protein analysis and prediction
title_sort research progress of reduced amino acid alphabets in protein analysis and prediction
topic Mini Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9284397/
https://www.ncbi.nlm.nih.gov/pubmed/35860409
http://dx.doi.org/10.1016/j.csbj.2022.07.001
work_keys_str_mv AT liangyuchao researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction
AT yangsiqi researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction
AT zhenglei researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction
AT wanghao researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction
AT zhoujian researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction
AT huangshenghui researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction
AT yanglei researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction
AT zuoyongchun researchprogressofreducedaminoacidalphabetsinproteinanalysisandprediction