Cargando…

MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy

BACKGROUND: Protein subcellular localization plays a crucial role in understanding cell function. Proteins need to be in the right place at the right time, and combine with the corresponding molecules to fulfill their functions. Furthermore, prediction of protein subcellular location not only should...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Fan, Liu, Yang, Wang, Yanbin, Yin, Zhijian, Yang, Zhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6815465/
https://www.ncbi.nlm.nih.gov/pubmed/31655541
http://dx.doi.org/10.1186/s12859-019-3136-3
_version_ 1783463187276890112
author Yang, Fan
Liu, Yang
Wang, Yanbin
Yin, Zhijian
Yang, Zhen
author_facet Yang, Fan
Liu, Yang
Wang, Yanbin
Yin, Zhijian
Yang, Zhen
author_sort Yang, Fan
collection PubMed
description BACKGROUND: Protein subcellular localization plays a crucial role in understanding cell function. Proteins need to be in the right place at the right time, and combine with the corresponding molecules to fulfill their functions. Furthermore, prediction of protein subcellular location not only should be a guiding role in drug design and development due to potential molecular targets but also be an essential role in genome annotation. Taking the current status of image-based protein subcellular localization as an example, there are three common drawbacks, i.e., obsolete datasets without updating label information, stereotypical feature descriptor on spatial domain or grey level, and single-function prediction algorithm’s limited capacity of handling single-label database. RESULTS: In this paper, a novel human protein subcellular localization prediction model MIC_Locator is proposed. Firstly, the latest datasets are collected and collated as our benchmark dataset instead of obsolete data while training prediction model. Secondly, Fourier transformation, Riesz transformation, Log-Gabor filter and intensity coding strategy are employed to obtain frequency feature based on three components of monogenic signal with different frequency scales. Thirdly, a chained prediction model is proposed to handle multi-label instead of single-label datasets. The experiment results showed that the MIC_Locator can achieve 60.56% subset accuracy and outperform the existing majority of prediction models, and the frequency feature and intensity coding strategy can be conducive to improving the classification accuracy. CONCLUSIONS: Our results demonstrate that the frequency feature is more beneficial for improving the performance of model compared to features extracted from spatial domain, and the MIC_Locator proposed in this paper can speed up validation of protein annotation, knowledge of protein function and proteomics research.
format Online
Article
Text
id pubmed-6815465
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68154652019-10-31 MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy Yang, Fan Liu, Yang Wang, Yanbin Yin, Zhijian Yang, Zhen BMC Bioinformatics Research Article BACKGROUND: Protein subcellular localization plays a crucial role in understanding cell function. Proteins need to be in the right place at the right time, and combine with the corresponding molecules to fulfill their functions. Furthermore, prediction of protein subcellular location not only should be a guiding role in drug design and development due to potential molecular targets but also be an essential role in genome annotation. Taking the current status of image-based protein subcellular localization as an example, there are three common drawbacks, i.e., obsolete datasets without updating label information, stereotypical feature descriptor on spatial domain or grey level, and single-function prediction algorithm’s limited capacity of handling single-label database. RESULTS: In this paper, a novel human protein subcellular localization prediction model MIC_Locator is proposed. Firstly, the latest datasets are collected and collated as our benchmark dataset instead of obsolete data while training prediction model. Secondly, Fourier transformation, Riesz transformation, Log-Gabor filter and intensity coding strategy are employed to obtain frequency feature based on three components of monogenic signal with different frequency scales. Thirdly, a chained prediction model is proposed to handle multi-label instead of single-label datasets. The experiment results showed that the MIC_Locator can achieve 60.56% subset accuracy and outperform the existing majority of prediction models, and the frequency feature and intensity coding strategy can be conducive to improving the classification accuracy. CONCLUSIONS: Our results demonstrate that the frequency feature is more beneficial for improving the performance of model compared to features extracted from spatial domain, and the MIC_Locator proposed in this paper can speed up validation of protein annotation, knowledge of protein function and proteomics research. BioMed Central 2019-10-26 /pmc/articles/PMC6815465/ /pubmed/31655541 http://dx.doi.org/10.1186/s12859-019-3136-3 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Yang, Fan
Liu, Yang
Wang, Yanbin
Yin, Zhijian
Yang, Zhen
MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_full MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_fullStr MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_full_unstemmed MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_short MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
title_sort mic_locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6815465/
https://www.ncbi.nlm.nih.gov/pubmed/31655541
http://dx.doi.org/10.1186/s12859-019-3136-3
work_keys_str_mv AT yangfan miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy
AT liuyang miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy
AT wangyanbin miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy
AT yinzhijian miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy
AT yangzhen miclocatoranovelimagebasedproteinsubcellularlocationmultilabelpredictionmodelbasedonmultiscalemonogenicsignalrepresentationandintensityencodingstrategy