Cargando…

Subcellular Localization Prediction of Human Proteins Using Multifeature Selection Methods

Subcellular localization attempts to assign proteins to one of the cell compartments that performs specific biological functions. Finding the link between proteins, biological functions, and subcellular localization is an effective way to investigate the general organization of living cells in a sys...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yu-Hang, Ding, ShiJian, Chen, Lei, Huang, Tao, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9484878/
https://www.ncbi.nlm.nih.gov/pubmed/36132086
http://dx.doi.org/10.1155/2022/3288527
_version_ 1784791968877903872
author Zhang, Yu-Hang
Ding, ShiJian
Chen, Lei
Huang, Tao
Cai, Yu-Dong
author_facet Zhang, Yu-Hang
Ding, ShiJian
Chen, Lei
Huang, Tao
Cai, Yu-Dong
author_sort Zhang, Yu-Hang
collection PubMed
description Subcellular localization attempts to assign proteins to one of the cell compartments that performs specific biological functions. Finding the link between proteins, biological functions, and subcellular localization is an effective way to investigate the general organization of living cells in a systematic manner. However, determining the subcellular localization of proteins by traditional experimental approaches is difficult. Here, protein–protein interaction networks, functional enrichment on gene ontology and pathway, and a set of proteins having confirmed subcellular localization were applied to build prediction models for human protein subcellular localizations. To build an effective predictive model, we employed a variety of robust machine learning algorithms, including Boruta feature selection, minimum redundancy maximum relevance, Monte Carlo feature selection, and LightGBM. Then, the incremental feature selection method with random forest and support vector machine was used to discover the essential features. Furthermore, 38 key features were determined by integrating results of different feature selection methods, which may provide critical insights into the subcellular location of proteins. Their biological functions of subcellular localizations were discussed according to recent publications. In summary, our computational framework can help advance the understanding of subcellular localization prediction techniques and provide a new perspective to investigate the patterns of protein subcellular localization and their biological importance.
format Online
Article
Text
id pubmed-9484878
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-94848782022-09-20 Subcellular Localization Prediction of Human Proteins Using Multifeature Selection Methods Zhang, Yu-Hang Ding, ShiJian Chen, Lei Huang, Tao Cai, Yu-Dong Biomed Res Int Research Article Subcellular localization attempts to assign proteins to one of the cell compartments that performs specific biological functions. Finding the link between proteins, biological functions, and subcellular localization is an effective way to investigate the general organization of living cells in a systematic manner. However, determining the subcellular localization of proteins by traditional experimental approaches is difficult. Here, protein–protein interaction networks, functional enrichment on gene ontology and pathway, and a set of proteins having confirmed subcellular localization were applied to build prediction models for human protein subcellular localizations. To build an effective predictive model, we employed a variety of robust machine learning algorithms, including Boruta feature selection, minimum redundancy maximum relevance, Monte Carlo feature selection, and LightGBM. Then, the incremental feature selection method with random forest and support vector machine was used to discover the essential features. Furthermore, 38 key features were determined by integrating results of different feature selection methods, which may provide critical insights into the subcellular location of proteins. Their biological functions of subcellular localizations were discussed according to recent publications. In summary, our computational framework can help advance the understanding of subcellular localization prediction techniques and provide a new perspective to investigate the patterns of protein subcellular localization and their biological importance. Hindawi 2022-09-12 /pmc/articles/PMC9484878/ /pubmed/36132086 http://dx.doi.org/10.1155/2022/3288527 Text en Copyright © 2022 Yu-Hang Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Yu-Hang
Ding, ShiJian
Chen, Lei
Huang, Tao
Cai, Yu-Dong
Subcellular Localization Prediction of Human Proteins Using Multifeature Selection Methods
title Subcellular Localization Prediction of Human Proteins Using Multifeature Selection Methods
title_full Subcellular Localization Prediction of Human Proteins Using Multifeature Selection Methods
title_fullStr Subcellular Localization Prediction of Human Proteins Using Multifeature Selection Methods
title_full_unstemmed Subcellular Localization Prediction of Human Proteins Using Multifeature Selection Methods
title_short Subcellular Localization Prediction of Human Proteins Using Multifeature Selection Methods
title_sort subcellular localization prediction of human proteins using multifeature selection methods
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9484878/
https://www.ncbi.nlm.nih.gov/pubmed/36132086
http://dx.doi.org/10.1155/2022/3288527
work_keys_str_mv AT zhangyuhang subcellularlocalizationpredictionofhumanproteinsusingmultifeatureselectionmethods
AT dingshijian subcellularlocalizationpredictionofhumanproteinsusingmultifeatureselectionmethods
AT chenlei subcellularlocalizationpredictionofhumanproteinsusingmultifeatureselectionmethods
AT huangtao subcellularlocalizationpredictionofhumanproteinsusingmultifeatureselectionmethods
AT caiyudong subcellularlocalizationpredictionofhumanproteinsusingmultifeatureselectionmethods