Cargando…

Identification of plant vacuole proteins by exploiting deep representation learning features

Plant vacuoles are the most important organelles for plant growth, development, and defense, and they play an important role in many types of stress responses. An important function of vacuole proteins is the transport of various classes of amino acids, ions, sugars, and other molecules. Accurate id...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiao, Shihu, Zou, Quan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9207291/
https://www.ncbi.nlm.nih.gov/pubmed/35765653
http://dx.doi.org/10.1016/j.csbj.2022.06.002
_version_ 1784729492909981696
author Jiao, Shihu
Zou, Quan
author_facet Jiao, Shihu
Zou, Quan
author_sort Jiao, Shihu
collection PubMed
description Plant vacuoles are the most important organelles for plant growth, development, and defense, and they play an important role in many types of stress responses. An important function of vacuole proteins is the transport of various classes of amino acids, ions, sugars, and other molecules. Accurate identification of vacuole proteins is crucial for revealing their biological functions. Several automatic and rapid computational tools have been proposed for the subcellular localization of proteins. Regrettably, they are not specific for the identification of plant vacuole proteins. To the best of our knowledge, there is only one computational software specifically trained for plant vacuolar proteins. Although its accuracy is acceptable, the prediction performance and stability of this method in practical applications can still be improved. Hence, in this study, a new predictor named iPVP-DRLF was developed to identify plant vacuole proteins specifically and effectively. This prediction software is designed using the light gradient boosting machine (LGBM) algorithm and hybrid features composed of classic sequence features and deep representation learning features. iPVP-DRLF achieved fivefold cross-validation and independent test accuracy values of 88.25 % and 87.16 %, respectively, both outperforming previous state-of-the-art predictors. Moreover, the blind dataset test results also showed that the performance of iPVP-DRLF was significantly better than the existing tools. The results of comparative experiments confirmed that deep representation learning features have an advantage over other classic sequence features in the identification of plant vacuole proteins. We believe that iPVP-DRLF would serve as an effective computational technique for plant vacuole protein prediction and facilitate related future research. The online server is freely accessible at https://lab.malab.cn/~acy/iPVP-DRLF. In addition, the source code and datasets are also accessible at https://github.com/jiaoshihu/iPVP-DRLF.
format Online
Article
Text
id pubmed-9207291
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-92072912022-06-27 Identification of plant vacuole proteins by exploiting deep representation learning features Jiao, Shihu Zou, Quan Comput Struct Biotechnol J Research Article Plant vacuoles are the most important organelles for plant growth, development, and defense, and they play an important role in many types of stress responses. An important function of vacuole proteins is the transport of various classes of amino acids, ions, sugars, and other molecules. Accurate identification of vacuole proteins is crucial for revealing their biological functions. Several automatic and rapid computational tools have been proposed for the subcellular localization of proteins. Regrettably, they are not specific for the identification of plant vacuole proteins. To the best of our knowledge, there is only one computational software specifically trained for plant vacuolar proteins. Although its accuracy is acceptable, the prediction performance and stability of this method in practical applications can still be improved. Hence, in this study, a new predictor named iPVP-DRLF was developed to identify plant vacuole proteins specifically and effectively. This prediction software is designed using the light gradient boosting machine (LGBM) algorithm and hybrid features composed of classic sequence features and deep representation learning features. iPVP-DRLF achieved fivefold cross-validation and independent test accuracy values of 88.25 % and 87.16 %, respectively, both outperforming previous state-of-the-art predictors. Moreover, the blind dataset test results also showed that the performance of iPVP-DRLF was significantly better than the existing tools. The results of comparative experiments confirmed that deep representation learning features have an advantage over other classic sequence features in the identification of plant vacuole proteins. We believe that iPVP-DRLF would serve as an effective computational technique for plant vacuole protein prediction and facilitate related future research. The online server is freely accessible at https://lab.malab.cn/~acy/iPVP-DRLF. In addition, the source code and datasets are also accessible at https://github.com/jiaoshihu/iPVP-DRLF. Research Network of Computational and Structural Biotechnology 2022-06-08 /pmc/articles/PMC9207291/ /pubmed/35765653 http://dx.doi.org/10.1016/j.csbj.2022.06.002 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Jiao, Shihu
Zou, Quan
Identification of plant vacuole proteins by exploiting deep representation learning features
title Identification of plant vacuole proteins by exploiting deep representation learning features
title_full Identification of plant vacuole proteins by exploiting deep representation learning features
title_fullStr Identification of plant vacuole proteins by exploiting deep representation learning features
title_full_unstemmed Identification of plant vacuole proteins by exploiting deep representation learning features
title_short Identification of plant vacuole proteins by exploiting deep representation learning features
title_sort identification of plant vacuole proteins by exploiting deep representation learning features
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9207291/
https://www.ncbi.nlm.nih.gov/pubmed/35765653
http://dx.doi.org/10.1016/j.csbj.2022.06.002
work_keys_str_mv AT jiaoshihu identificationofplantvacuoleproteinsbyexploitingdeeprepresentationlearningfeatures
AT zouquan identificationofplantvacuoleproteinsbyexploitingdeeprepresentationlearningfeatures