Cargando…
Identifying Antifreeze Proteins Based on Key Evolutionary Information
Antifreeze proteins are important antifreeze materials that have been widely used in industry, including in cryopreservation, de-icing, and food storage applications. However, the quantity of some commercially produced antifreeze proteins is insufficient for large-scale industrial applications. Furt...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7113384/ https://www.ncbi.nlm.nih.gov/pubmed/32274383 http://dx.doi.org/10.3389/fbioe.2020.00244 |
_version_ | 1783513657340067840 |
---|---|
author | Sun, Shanwen Ding, Hui Wang, Donghua Han, Shuguang |
author_facet | Sun, Shanwen Ding, Hui Wang, Donghua Han, Shuguang |
author_sort | Sun, Shanwen |
collection | PubMed |
description | Antifreeze proteins are important antifreeze materials that have been widely used in industry, including in cryopreservation, de-icing, and food storage applications. However, the quantity of some commercially produced antifreeze proteins is insufficient for large-scale industrial applications. Further, many antifreeze proteins have properties such as cytotoxicity, severely hindering their applications. Understanding the mechanisms underlying the protein–ice interactions and identifying novel antifreeze proteins are, therefore, urgently needed. In this study, to uncover the mechanisms underlying protein–ice interactions and provide an efficient and accurate tool for identifying antifreeze proteins, we assessed various evolutionary features based on position-specific scoring matrices (PSSMs) and evaluated their importance for discriminating of antifreeze and non-antifreeze proteins. We then parsimoniously selected seven key features with the highest importance. We found that the selected features showed opposite tendencies (regarding the conservation of certain amino acids) between antifreeze and non-antifreeze proteins. Five out of the seven features had relatively high contributions to the discrimination of antifreeze and non-antifreeze proteins, as revealed by a principal component analysis, i.e., the conservation of the replacement of Cys, Trp, and Gly in antifreeze proteins by Ala, Met, and Ala, respectively, in the related proteins, and the conservation of the replacement of Arg in non-antifreeze proteins by Ser and Arg in the related proteins. Based on the seven parsimoniously selected key features, we established a classifier using support vector machine, which outperformed the state-of-the-art tools. These results suggest that understanding evolutionary information is crucial to designing accurate automated methods for discriminating antifreeze and non-antifreeze proteins. Our classifier, therefore, is an efficient tool for annotating new proteins with antifreeze functions based on sequence information and can facilitate their application in industry. |
format | Online Article Text |
id | pubmed-7113384 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-71133842020-04-09 Identifying Antifreeze Proteins Based on Key Evolutionary Information Sun, Shanwen Ding, Hui Wang, Donghua Han, Shuguang Front Bioeng Biotechnol Bioengineering and Biotechnology Antifreeze proteins are important antifreeze materials that have been widely used in industry, including in cryopreservation, de-icing, and food storage applications. However, the quantity of some commercially produced antifreeze proteins is insufficient for large-scale industrial applications. Further, many antifreeze proteins have properties such as cytotoxicity, severely hindering their applications. Understanding the mechanisms underlying the protein–ice interactions and identifying novel antifreeze proteins are, therefore, urgently needed. In this study, to uncover the mechanisms underlying protein–ice interactions and provide an efficient and accurate tool for identifying antifreeze proteins, we assessed various evolutionary features based on position-specific scoring matrices (PSSMs) and evaluated their importance for discriminating of antifreeze and non-antifreeze proteins. We then parsimoniously selected seven key features with the highest importance. We found that the selected features showed opposite tendencies (regarding the conservation of certain amino acids) between antifreeze and non-antifreeze proteins. Five out of the seven features had relatively high contributions to the discrimination of antifreeze and non-antifreeze proteins, as revealed by a principal component analysis, i.e., the conservation of the replacement of Cys, Trp, and Gly in antifreeze proteins by Ala, Met, and Ala, respectively, in the related proteins, and the conservation of the replacement of Arg in non-antifreeze proteins by Ser and Arg in the related proteins. Based on the seven parsimoniously selected key features, we established a classifier using support vector machine, which outperformed the state-of-the-art tools. These results suggest that understanding evolutionary information is crucial to designing accurate automated methods for discriminating antifreeze and non-antifreeze proteins. Our classifier, therefore, is an efficient tool for annotating new proteins with antifreeze functions based on sequence information and can facilitate their application in industry. Frontiers Media S.A. 2020-03-26 /pmc/articles/PMC7113384/ /pubmed/32274383 http://dx.doi.org/10.3389/fbioe.2020.00244 Text en Copyright © 2020 Sun, Ding, Wang and Han. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioengineering and Biotechnology Sun, Shanwen Ding, Hui Wang, Donghua Han, Shuguang Identifying Antifreeze Proteins Based on Key Evolutionary Information |
title | Identifying Antifreeze Proteins Based on Key Evolutionary Information |
title_full | Identifying Antifreeze Proteins Based on Key Evolutionary Information |
title_fullStr | Identifying Antifreeze Proteins Based on Key Evolutionary Information |
title_full_unstemmed | Identifying Antifreeze Proteins Based on Key Evolutionary Information |
title_short | Identifying Antifreeze Proteins Based on Key Evolutionary Information |
title_sort | identifying antifreeze proteins based on key evolutionary information |
topic | Bioengineering and Biotechnology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7113384/ https://www.ncbi.nlm.nih.gov/pubmed/32274383 http://dx.doi.org/10.3389/fbioe.2020.00244 |
work_keys_str_mv | AT sunshanwen identifyingantifreezeproteinsbasedonkeyevolutionaryinformation AT dinghui identifyingantifreezeproteinsbasedonkeyevolutionaryinformation AT wangdonghua identifyingantifreezeproteinsbasedonkeyevolutionaryinformation AT hanshuguang identifyingantifreezeproteinsbasedonkeyevolutionaryinformation |