Cargando…
Predicting subcellular location of protein with evolution information and sequence-based deep learning
BACKGROUND: Protein subcellular localization prediction plays an important role in biology research. Since traditional methods are laborious and time-consuming, many machine learning-based prediction methods have been proposed. However, most of the proposed methods ignore the evolution information o...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8539821/ https://www.ncbi.nlm.nih.gov/pubmed/34686152 http://dx.doi.org/10.1186/s12859-021-04404-0 |
_version_ | 1784588839476527104 |
---|---|
author | Liao, Zhijun Pan, Gaofeng Sun, Chao Tang, Jijun |
author_facet | Liao, Zhijun Pan, Gaofeng Sun, Chao Tang, Jijun |
author_sort | Liao, Zhijun |
collection | PubMed |
description | BACKGROUND: Protein subcellular localization prediction plays an important role in biology research. Since traditional methods are laborious and time-consuming, many machine learning-based prediction methods have been proposed. However, most of the proposed methods ignore the evolution information of proteins. In order to improve the prediction accuracy, we present a deep learning-based method to predict protein subcellular locations. RESULTS: Our method utilizes not only amino acid compositions sequence but also evolution matrices of proteins. Our method uses a bidirectional long short-term memory network that processes the entire protein sequence and a convolutional neural network that extracts features from protein sequences. The position specific scoring matrix is used as a supplement to protein sequences. Our method was trained and tested on two benchmark datasets. The experiment results show that our method yields accurate results on the two datasets with an average precision of 0.7901, ranking loss of 0.0758 and coverage of 1.2848. CONCLUSION: The experiment results show that our method outperforms five methods currently available. According to those experiments, we can see that our method is an acceptable alternative to predict protein subcellular location. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04404-0. |
format | Online Article Text |
id | pubmed-8539821 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-85398212021-10-25 Predicting subcellular location of protein with evolution information and sequence-based deep learning Liao, Zhijun Pan, Gaofeng Sun, Chao Tang, Jijun BMC Bioinformatics Research BACKGROUND: Protein subcellular localization prediction plays an important role in biology research. Since traditional methods are laborious and time-consuming, many machine learning-based prediction methods have been proposed. However, most of the proposed methods ignore the evolution information of proteins. In order to improve the prediction accuracy, we present a deep learning-based method to predict protein subcellular locations. RESULTS: Our method utilizes not only amino acid compositions sequence but also evolution matrices of proteins. Our method uses a bidirectional long short-term memory network that processes the entire protein sequence and a convolutional neural network that extracts features from protein sequences. The position specific scoring matrix is used as a supplement to protein sequences. Our method was trained and tested on two benchmark datasets. The experiment results show that our method yields accurate results on the two datasets with an average precision of 0.7901, ranking loss of 0.0758 and coverage of 1.2848. CONCLUSION: The experiment results show that our method outperforms five methods currently available. According to those experiments, we can see that our method is an acceptable alternative to predict protein subcellular location. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04404-0. BioMed Central 2021-10-22 /pmc/articles/PMC8539821/ /pubmed/34686152 http://dx.doi.org/10.1186/s12859-021-04404-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Liao, Zhijun Pan, Gaofeng Sun, Chao Tang, Jijun Predicting subcellular location of protein with evolution information and sequence-based deep learning |
title | Predicting subcellular location of protein with evolution information and sequence-based deep learning |
title_full | Predicting subcellular location of protein with evolution information and sequence-based deep learning |
title_fullStr | Predicting subcellular location of protein with evolution information and sequence-based deep learning |
title_full_unstemmed | Predicting subcellular location of protein with evolution information and sequence-based deep learning |
title_short | Predicting subcellular location of protein with evolution information and sequence-based deep learning |
title_sort | predicting subcellular location of protein with evolution information and sequence-based deep learning |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8539821/ https://www.ncbi.nlm.nih.gov/pubmed/34686152 http://dx.doi.org/10.1186/s12859-021-04404-0 |
work_keys_str_mv | AT liaozhijun predictingsubcellularlocationofproteinwithevolutioninformationandsequencebaseddeeplearning AT pangaofeng predictingsubcellularlocationofproteinwithevolutioninformationandsequencebaseddeeplearning AT sunchao predictingsubcellularlocationofproteinwithevolutioninformationandsequencebaseddeeplearning AT tangjijun predictingsubcellularlocationofproteinwithevolutioninformationandsequencebaseddeeplearning |