Cargando…

Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence

Protein is one of the primary biochemical macromolecular regulators in the compartmental cellular structure, and the subcellular locations of proteins can therefore provide information on the function of subcellular structures and physiological environments. Recently, data-driven systems have been d...

Descripción completa

Detalles Bibliográficos
Autores principales: Zou, Kai, Wang, Simeng, Wang, Ziqian, Zou, Hongliang, Yang, Fan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10675401/
https://www.ncbi.nlm.nih.gov/pubmed/38005402
http://dx.doi.org/10.3390/s23229014
_version_ 1785141056226983936
author Zou, Kai
Wang, Simeng
Wang, Ziqian
Zou, Hongliang
Yang, Fan
author_facet Zou, Kai
Wang, Simeng
Wang, Ziqian
Zou, Hongliang
Yang, Fan
author_sort Zou, Kai
collection PubMed
description Protein is one of the primary biochemical macromolecular regulators in the compartmental cellular structure, and the subcellular locations of proteins can therefore provide information on the function of subcellular structures and physiological environments. Recently, data-driven systems have been developed to predict the subcellular location of proteins based on protein sequence, immunohistochemistry (IHC) images, or immunofluorescence (IF) images. However, the research on the fusion of multiple protein signals has received little attention. In this study, we developed a dual-signal computational protocol by incorporating IHC images into protein sequences to learn protein subcellular localization. Three major steps can be summarized as follows in this protocol: first, a benchmark database that includes 281 proteins sorted out from 4722 proteins of the Human Protein Atlas (HPA) and Swiss-Prot database, which is involved in the endoplasmic reticulum (ER), Golgi apparatus, cytosol, and nucleoplasm; second, discriminative feature operators were first employed to quantitate protein image-sequence samples that include IHC images and protein sequence; finally, the feature subspace of different protein signals is absorbed to construct multiple sub-classifiers via dimensionality reduction and binary relevance (BR), and multiple confidence derived from multiple sub-classifiers is adopted to decide subcellular location by the centralized voting mechanism at the decision layer. The experimental results indicated that the dual-signal model embedded IHC images and protein sequences outperformed the single-signal models with accuracy, precision, and recall of 75.41%, 80.38%, and 74.38%, respectively. It is enlightening for further research on protein subcellular location prediction under multi-signal fusion of protein.
format Online
Article
Text
id pubmed-10675401
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106754012023-11-07 Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence Zou, Kai Wang, Simeng Wang, Ziqian Zou, Hongliang Yang, Fan Sensors (Basel) Article Protein is one of the primary biochemical macromolecular regulators in the compartmental cellular structure, and the subcellular locations of proteins can therefore provide information on the function of subcellular structures and physiological environments. Recently, data-driven systems have been developed to predict the subcellular location of proteins based on protein sequence, immunohistochemistry (IHC) images, or immunofluorescence (IF) images. However, the research on the fusion of multiple protein signals has received little attention. In this study, we developed a dual-signal computational protocol by incorporating IHC images into protein sequences to learn protein subcellular localization. Three major steps can be summarized as follows in this protocol: first, a benchmark database that includes 281 proteins sorted out from 4722 proteins of the Human Protein Atlas (HPA) and Swiss-Prot database, which is involved in the endoplasmic reticulum (ER), Golgi apparatus, cytosol, and nucleoplasm; second, discriminative feature operators were first employed to quantitate protein image-sequence samples that include IHC images and protein sequence; finally, the feature subspace of different protein signals is absorbed to construct multiple sub-classifiers via dimensionality reduction and binary relevance (BR), and multiple confidence derived from multiple sub-classifiers is adopted to decide subcellular location by the centralized voting mechanism at the decision layer. The experimental results indicated that the dual-signal model embedded IHC images and protein sequences outperformed the single-signal models with accuracy, precision, and recall of 75.41%, 80.38%, and 74.38%, respectively. It is enlightening for further research on protein subcellular location prediction under multi-signal fusion of protein. MDPI 2023-11-07 /pmc/articles/PMC10675401/ /pubmed/38005402 http://dx.doi.org/10.3390/s23229014 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zou, Kai
Wang, Simeng
Wang, Ziqian
Zou, Hongliang
Yang, Fan
Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence
title Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence
title_full Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence
title_fullStr Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence
title_full_unstemmed Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence
title_short Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence
title_sort dual-signal feature spaces map protein subcellular locations based on immunohistochemistry image and protein sequence
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10675401/
https://www.ncbi.nlm.nih.gov/pubmed/38005402
http://dx.doi.org/10.3390/s23229014
work_keys_str_mv AT zoukai dualsignalfeaturespacesmapproteinsubcellularlocationsbasedonimmunohistochemistryimageandproteinsequence
AT wangsimeng dualsignalfeaturespacesmapproteinsubcellularlocationsbasedonimmunohistochemistryimageandproteinsequence
AT wangziqian dualsignalfeaturespacesmapproteinsubcellularlocationsbasedonimmunohistochemistryimageandproteinsequence
AT zouhongliang dualsignalfeaturespacesmapproteinsubcellularlocationsbasedonimmunohistochemistryimageandproteinsequence
AT yangfan dualsignalfeaturespacesmapproteinsubcellularlocationsbasedonimmunohistochemistryimageandproteinsequence