Cargando…

Identifying Antifreeze Proteins Based on Key Evolutionary Information

Antifreeze proteins are important antifreeze materials that have been widely used in industry, including in cryopreservation, de-icing, and food storage applications. However, the quantity of some commercially produced antifreeze proteins is insufficient for large-scale industrial applications. Furt...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Shanwen, Ding, Hui, Wang, Donghua, Han, Shuguang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7113384/
https://www.ncbi.nlm.nih.gov/pubmed/32274383
http://dx.doi.org/10.3389/fbioe.2020.00244
_version_ 1783513657340067840
author Sun, Shanwen
Ding, Hui
Wang, Donghua
Han, Shuguang
author_facet Sun, Shanwen
Ding, Hui
Wang, Donghua
Han, Shuguang
author_sort Sun, Shanwen
collection PubMed
description Antifreeze proteins are important antifreeze materials that have been widely used in industry, including in cryopreservation, de-icing, and food storage applications. However, the quantity of some commercially produced antifreeze proteins is insufficient for large-scale industrial applications. Further, many antifreeze proteins have properties such as cytotoxicity, severely hindering their applications. Understanding the mechanisms underlying the protein–ice interactions and identifying novel antifreeze proteins are, therefore, urgently needed. In this study, to uncover the mechanisms underlying protein–ice interactions and provide an efficient and accurate tool for identifying antifreeze proteins, we assessed various evolutionary features based on position-specific scoring matrices (PSSMs) and evaluated their importance for discriminating of antifreeze and non-antifreeze proteins. We then parsimoniously selected seven key features with the highest importance. We found that the selected features showed opposite tendencies (regarding the conservation of certain amino acids) between antifreeze and non-antifreeze proteins. Five out of the seven features had relatively high contributions to the discrimination of antifreeze and non-antifreeze proteins, as revealed by a principal component analysis, i.e., the conservation of the replacement of Cys, Trp, and Gly in antifreeze proteins by Ala, Met, and Ala, respectively, in the related proteins, and the conservation of the replacement of Arg in non-antifreeze proteins by Ser and Arg in the related proteins. Based on the seven parsimoniously selected key features, we established a classifier using support vector machine, which outperformed the state-of-the-art tools. These results suggest that understanding evolutionary information is crucial to designing accurate automated methods for discriminating antifreeze and non-antifreeze proteins. Our classifier, therefore, is an efficient tool for annotating new proteins with antifreeze functions based on sequence information and can facilitate their application in industry.
format Online
Article
Text
id pubmed-7113384
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-71133842020-04-09 Identifying Antifreeze Proteins Based on Key Evolutionary Information Sun, Shanwen Ding, Hui Wang, Donghua Han, Shuguang Front Bioeng Biotechnol Bioengineering and Biotechnology Antifreeze proteins are important antifreeze materials that have been widely used in industry, including in cryopreservation, de-icing, and food storage applications. However, the quantity of some commercially produced antifreeze proteins is insufficient for large-scale industrial applications. Further, many antifreeze proteins have properties such as cytotoxicity, severely hindering their applications. Understanding the mechanisms underlying the protein–ice interactions and identifying novel antifreeze proteins are, therefore, urgently needed. In this study, to uncover the mechanisms underlying protein–ice interactions and provide an efficient and accurate tool for identifying antifreeze proteins, we assessed various evolutionary features based on position-specific scoring matrices (PSSMs) and evaluated their importance for discriminating of antifreeze and non-antifreeze proteins. We then parsimoniously selected seven key features with the highest importance. We found that the selected features showed opposite tendencies (regarding the conservation of certain amino acids) between antifreeze and non-antifreeze proteins. Five out of the seven features had relatively high contributions to the discrimination of antifreeze and non-antifreeze proteins, as revealed by a principal component analysis, i.e., the conservation of the replacement of Cys, Trp, and Gly in antifreeze proteins by Ala, Met, and Ala, respectively, in the related proteins, and the conservation of the replacement of Arg in non-antifreeze proteins by Ser and Arg in the related proteins. Based on the seven parsimoniously selected key features, we established a classifier using support vector machine, which outperformed the state-of-the-art tools. These results suggest that understanding evolutionary information is crucial to designing accurate automated methods for discriminating antifreeze and non-antifreeze proteins. Our classifier, therefore, is an efficient tool for annotating new proteins with antifreeze functions based on sequence information and can facilitate their application in industry. Frontiers Media S.A. 2020-03-26 /pmc/articles/PMC7113384/ /pubmed/32274383 http://dx.doi.org/10.3389/fbioe.2020.00244 Text en Copyright © 2020 Sun, Ding, Wang and Han. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Sun, Shanwen
Ding, Hui
Wang, Donghua
Han, Shuguang
Identifying Antifreeze Proteins Based on Key Evolutionary Information
title Identifying Antifreeze Proteins Based on Key Evolutionary Information
title_full Identifying Antifreeze Proteins Based on Key Evolutionary Information
title_fullStr Identifying Antifreeze Proteins Based on Key Evolutionary Information
title_full_unstemmed Identifying Antifreeze Proteins Based on Key Evolutionary Information
title_short Identifying Antifreeze Proteins Based on Key Evolutionary Information
title_sort identifying antifreeze proteins based on key evolutionary information
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7113384/
https://www.ncbi.nlm.nih.gov/pubmed/32274383
http://dx.doi.org/10.3389/fbioe.2020.00244
work_keys_str_mv AT sunshanwen identifyingantifreezeproteinsbasedonkeyevolutionaryinformation
AT dinghui identifyingantifreezeproteinsbasedonkeyevolutionaryinformation
AT wangdonghua identifyingantifreezeproteinsbasedonkeyevolutionaryinformation
AT hanshuguang identifyingantifreezeproteinsbasedonkeyevolutionaryinformation