Cargando…
Identification of plant vacuole proteins by exploiting deep representation learning features
Plant vacuoles are the most important organelles for plant growth, development, and defense, and they play an important role in many types of stress responses. An important function of vacuole proteins is the transport of various classes of amino acids, ions, sugars, and other molecules. Accurate id...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9207291/ https://www.ncbi.nlm.nih.gov/pubmed/35765653 http://dx.doi.org/10.1016/j.csbj.2022.06.002 |
_version_ | 1784729492909981696 |
---|---|
author | Jiao, Shihu Zou, Quan |
author_facet | Jiao, Shihu Zou, Quan |
author_sort | Jiao, Shihu |
collection | PubMed |
description | Plant vacuoles are the most important organelles for plant growth, development, and defense, and they play an important role in many types of stress responses. An important function of vacuole proteins is the transport of various classes of amino acids, ions, sugars, and other molecules. Accurate identification of vacuole proteins is crucial for revealing their biological functions. Several automatic and rapid computational tools have been proposed for the subcellular localization of proteins. Regrettably, they are not specific for the identification of plant vacuole proteins. To the best of our knowledge, there is only one computational software specifically trained for plant vacuolar proteins. Although its accuracy is acceptable, the prediction performance and stability of this method in practical applications can still be improved. Hence, in this study, a new predictor named iPVP-DRLF was developed to identify plant vacuole proteins specifically and effectively. This prediction software is designed using the light gradient boosting machine (LGBM) algorithm and hybrid features composed of classic sequence features and deep representation learning features. iPVP-DRLF achieved fivefold cross-validation and independent test accuracy values of 88.25 % and 87.16 %, respectively, both outperforming previous state-of-the-art predictors. Moreover, the blind dataset test results also showed that the performance of iPVP-DRLF was significantly better than the existing tools. The results of comparative experiments confirmed that deep representation learning features have an advantage over other classic sequence features in the identification of plant vacuole proteins. We believe that iPVP-DRLF would serve as an effective computational technique for plant vacuole protein prediction and facilitate related future research. The online server is freely accessible at https://lab.malab.cn/~acy/iPVP-DRLF. In addition, the source code and datasets are also accessible at https://github.com/jiaoshihu/iPVP-DRLF. |
format | Online Article Text |
id | pubmed-9207291 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-92072912022-06-27 Identification of plant vacuole proteins by exploiting deep representation learning features Jiao, Shihu Zou, Quan Comput Struct Biotechnol J Research Article Plant vacuoles are the most important organelles for plant growth, development, and defense, and they play an important role in many types of stress responses. An important function of vacuole proteins is the transport of various classes of amino acids, ions, sugars, and other molecules. Accurate identification of vacuole proteins is crucial for revealing their biological functions. Several automatic and rapid computational tools have been proposed for the subcellular localization of proteins. Regrettably, they are not specific for the identification of plant vacuole proteins. To the best of our knowledge, there is only one computational software specifically trained for plant vacuolar proteins. Although its accuracy is acceptable, the prediction performance and stability of this method in practical applications can still be improved. Hence, in this study, a new predictor named iPVP-DRLF was developed to identify plant vacuole proteins specifically and effectively. This prediction software is designed using the light gradient boosting machine (LGBM) algorithm and hybrid features composed of classic sequence features and deep representation learning features. iPVP-DRLF achieved fivefold cross-validation and independent test accuracy values of 88.25 % and 87.16 %, respectively, both outperforming previous state-of-the-art predictors. Moreover, the blind dataset test results also showed that the performance of iPVP-DRLF was significantly better than the existing tools. The results of comparative experiments confirmed that deep representation learning features have an advantage over other classic sequence features in the identification of plant vacuole proteins. We believe that iPVP-DRLF would serve as an effective computational technique for plant vacuole protein prediction and facilitate related future research. The online server is freely accessible at https://lab.malab.cn/~acy/iPVP-DRLF. In addition, the source code and datasets are also accessible at https://github.com/jiaoshihu/iPVP-DRLF. Research Network of Computational and Structural Biotechnology 2022-06-08 /pmc/articles/PMC9207291/ /pubmed/35765653 http://dx.doi.org/10.1016/j.csbj.2022.06.002 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Jiao, Shihu Zou, Quan Identification of plant vacuole proteins by exploiting deep representation learning features |
title | Identification of plant vacuole proteins by exploiting deep representation learning features |
title_full | Identification of plant vacuole proteins by exploiting deep representation learning features |
title_fullStr | Identification of plant vacuole proteins by exploiting deep representation learning features |
title_full_unstemmed | Identification of plant vacuole proteins by exploiting deep representation learning features |
title_short | Identification of plant vacuole proteins by exploiting deep representation learning features |
title_sort | identification of plant vacuole proteins by exploiting deep representation learning features |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9207291/ https://www.ncbi.nlm.nih.gov/pubmed/35765653 http://dx.doi.org/10.1016/j.csbj.2022.06.002 |
work_keys_str_mv | AT jiaoshihu identificationofplantvacuoleproteinsbyexploitingdeeprepresentationlearningfeatures AT zouquan identificationofplantvacuoleproteinsbyexploitingdeeprepresentationlearningfeatures |