Cargando…

Prediction and analysis of antifreeze proteins

Antifreeze proteins (AFPs) are proteins that protect cellular fluids and body fluids from freezing by inhibiting the nucleation and growth of ice crystals and preventing ice recrystallization, thereby contributing to the maintenance of life in living organisms. They exist in fish, insects, microorga...

Descripción completa

Detalles Bibliográficos
Autores principales: Miyata, Ryosuke, Moriwaki, Yoshitaka, Terada, Tohru, Shimizu, Kentaro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8473546/
https://www.ncbi.nlm.nih.gov/pubmed/34604556
http://dx.doi.org/10.1016/j.heliyon.2021.e07953
_version_ 1784575012202610688
author Miyata, Ryosuke
Moriwaki, Yoshitaka
Terada, Tohru
Shimizu, Kentaro
author_facet Miyata, Ryosuke
Moriwaki, Yoshitaka
Terada, Tohru
Shimizu, Kentaro
author_sort Miyata, Ryosuke
collection PubMed
description Antifreeze proteins (AFPs) are proteins that protect cellular fluids and body fluids from freezing by inhibiting the nucleation and growth of ice crystals and preventing ice recrystallization, thereby contributing to the maintenance of life in living organisms. They exist in fish, insects, microorganisms, and fungi. However, the number of known AFPs is currently limited, and it is essential to construct a reliable dataset of AFPs and develop a bioinformatics tool to predict AFPs. In this work, we first collected AFPs sequences from UniProtKB considering the reliability of annotations and, based on these datasets, developed a prediction system using random forest. We achieved accuracies of 0.961 and 0.947 for non-redundant sequences with less than 90% and 30% identities and achieved the accuracy of 0.953 for representative sequences for each species. Using the ability of random forest, we identified the sequence features that contributed to the prediction. Some sequence features were common to AFPs from different species. These features include the Cys content, Ala-Ala content, Trp-Gly content, and the amino acids’ distribution related to the disorder propensity. The computer program and the dataset developed in this work are available from the GitHub site: https://github.com/ryomiya/Prediction-and-analysis-of-antifreeze-proteins.
format Online
Article
Text
id pubmed-8473546
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-84735462021-10-01 Prediction and analysis of antifreeze proteins Miyata, Ryosuke Moriwaki, Yoshitaka Terada, Tohru Shimizu, Kentaro Heliyon Research Article Antifreeze proteins (AFPs) are proteins that protect cellular fluids and body fluids from freezing by inhibiting the nucleation and growth of ice crystals and preventing ice recrystallization, thereby contributing to the maintenance of life in living organisms. They exist in fish, insects, microorganisms, and fungi. However, the number of known AFPs is currently limited, and it is essential to construct a reliable dataset of AFPs and develop a bioinformatics tool to predict AFPs. In this work, we first collected AFPs sequences from UniProtKB considering the reliability of annotations and, based on these datasets, developed a prediction system using random forest. We achieved accuracies of 0.961 and 0.947 for non-redundant sequences with less than 90% and 30% identities and achieved the accuracy of 0.953 for representative sequences for each species. Using the ability of random forest, we identified the sequence features that contributed to the prediction. Some sequence features were common to AFPs from different species. These features include the Cys content, Ala-Ala content, Trp-Gly content, and the amino acids’ distribution related to the disorder propensity. The computer program and the dataset developed in this work are available from the GitHub site: https://github.com/ryomiya/Prediction-and-analysis-of-antifreeze-proteins. Elsevier 2021-09-08 /pmc/articles/PMC8473546/ /pubmed/34604556 http://dx.doi.org/10.1016/j.heliyon.2021.e07953 Text en © 2021 The Authors. Published by Elsevier Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Miyata, Ryosuke
Moriwaki, Yoshitaka
Terada, Tohru
Shimizu, Kentaro
Prediction and analysis of antifreeze proteins
title Prediction and analysis of antifreeze proteins
title_full Prediction and analysis of antifreeze proteins
title_fullStr Prediction and analysis of antifreeze proteins
title_full_unstemmed Prediction and analysis of antifreeze proteins
title_short Prediction and analysis of antifreeze proteins
title_sort prediction and analysis of antifreeze proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8473546/
https://www.ncbi.nlm.nih.gov/pubmed/34604556
http://dx.doi.org/10.1016/j.heliyon.2021.e07953
work_keys_str_mv AT miyataryosuke predictionandanalysisofantifreezeproteins
AT moriwakiyoshitaka predictionandanalysisofantifreezeproteins
AT teradatohru predictionandanalysisofantifreezeproteins
AT shimizukentaro predictionandanalysisofantifreezeproteins