Cargando…
Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information
The subcellular locations of proteins are closely related to their functions. In the past few decades, the application of machine learning algorithms to predict protein subcellular locations has been an important topic in proteomics. However, most studies in this field used only amino acid sequences...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8615982/ https://www.ncbi.nlm.nih.gov/pubmed/34827605 http://dx.doi.org/10.3390/biom11111607 |
_version_ | 1784604237695549440 |
---|---|
author | Wang, Ge Zhai, Yu-Jia Xue, Zhen-Zhen Xu, Ying-Ying |
author_facet | Wang, Ge Zhai, Yu-Jia Xue, Zhen-Zhen Xu, Ying-Ying |
author_sort | Wang, Ge |
collection | PubMed |
description | The subcellular locations of proteins are closely related to their functions. In the past few decades, the application of machine learning algorithms to predict protein subcellular locations has been an important topic in proteomics. However, most studies in this field used only amino acid sequences as the data source. Only a few works focused on other protein data types. For example, three-dimensional structures, which contain far more functional protein information than sequences, remain to be explored. In this work, we extracted various handcrafted features to describe the protein structures from physical, chemical, and topological aspects, as well as the learned features obtained by deep neural networks. We then used these features to classify the protein subcellular locations. Our experimental results demonstrated that some of these structural features have a certain effect on the protein location classification, and can help improve the performance of sequence-based location predictors. Our method provides a new view for the analysis of protein spatial distribution, and is anticipated to be used in revealing the relationships between protein structures and functions. |
format | Online Article Text |
id | pubmed-8615982 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-86159822021-11-26 Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information Wang, Ge Zhai, Yu-Jia Xue, Zhen-Zhen Xu, Ying-Ying Biomolecules Article The subcellular locations of proteins are closely related to their functions. In the past few decades, the application of machine learning algorithms to predict protein subcellular locations has been an important topic in proteomics. However, most studies in this field used only amino acid sequences as the data source. Only a few works focused on other protein data types. For example, three-dimensional structures, which contain far more functional protein information than sequences, remain to be explored. In this work, we extracted various handcrafted features to describe the protein structures from physical, chemical, and topological aspects, as well as the learned features obtained by deep neural networks. We then used these features to classify the protein subcellular locations. Our experimental results demonstrated that some of these structural features have a certain effect on the protein location classification, and can help improve the performance of sequence-based location predictors. Our method provides a new view for the analysis of protein spatial distribution, and is anticipated to be used in revealing the relationships between protein structures and functions. MDPI 2021-10-29 /pmc/articles/PMC8615982/ /pubmed/34827605 http://dx.doi.org/10.3390/biom11111607 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Ge Zhai, Yu-Jia Xue, Zhen-Zhen Xu, Ying-Ying Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information |
title | Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information |
title_full | Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information |
title_fullStr | Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information |
title_full_unstemmed | Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information |
title_short | Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information |
title_sort | improving protein subcellular location classification by incorporating three-dimensional structure information |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8615982/ https://www.ncbi.nlm.nih.gov/pubmed/34827605 http://dx.doi.org/10.3390/biom11111607 |
work_keys_str_mv | AT wangge improvingproteinsubcellularlocationclassificationbyincorporatingthreedimensionalstructureinformation AT zhaiyujia improvingproteinsubcellularlocationclassificationbyincorporatingthreedimensionalstructureinformation AT xuezhenzhen improvingproteinsubcellularlocationclassificationbyincorporatingthreedimensionalstructureinformation AT xuyingying improvingproteinsubcellularlocationclassificationbyincorporatingthreedimensionalstructureinformation |