Cargando…

Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information

The subcellular locations of proteins are closely related to their functions. In the past few decades, the application of machine learning algorithms to predict protein subcellular locations has been an important topic in proteomics. However, most studies in this field used only amino acid sequences...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Ge, Zhai, Yu-Jia, Xue, Zhen-Zhen, Xu, Ying-Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8615982/
https://www.ncbi.nlm.nih.gov/pubmed/34827605
http://dx.doi.org/10.3390/biom11111607
_version_ 1784604237695549440
author Wang, Ge
Zhai, Yu-Jia
Xue, Zhen-Zhen
Xu, Ying-Ying
author_facet Wang, Ge
Zhai, Yu-Jia
Xue, Zhen-Zhen
Xu, Ying-Ying
author_sort Wang, Ge
collection PubMed
description The subcellular locations of proteins are closely related to their functions. In the past few decades, the application of machine learning algorithms to predict protein subcellular locations has been an important topic in proteomics. However, most studies in this field used only amino acid sequences as the data source. Only a few works focused on other protein data types. For example, three-dimensional structures, which contain far more functional protein information than sequences, remain to be explored. In this work, we extracted various handcrafted features to describe the protein structures from physical, chemical, and topological aspects, as well as the learned features obtained by deep neural networks. We then used these features to classify the protein subcellular locations. Our experimental results demonstrated that some of these structural features have a certain effect on the protein location classification, and can help improve the performance of sequence-based location predictors. Our method provides a new view for the analysis of protein spatial distribution, and is anticipated to be used in revealing the relationships between protein structures and functions.
format Online
Article
Text
id pubmed-8615982
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86159822021-11-26 Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information Wang, Ge Zhai, Yu-Jia Xue, Zhen-Zhen Xu, Ying-Ying Biomolecules Article The subcellular locations of proteins are closely related to their functions. In the past few decades, the application of machine learning algorithms to predict protein subcellular locations has been an important topic in proteomics. However, most studies in this field used only amino acid sequences as the data source. Only a few works focused on other protein data types. For example, three-dimensional structures, which contain far more functional protein information than sequences, remain to be explored. In this work, we extracted various handcrafted features to describe the protein structures from physical, chemical, and topological aspects, as well as the learned features obtained by deep neural networks. We then used these features to classify the protein subcellular locations. Our experimental results demonstrated that some of these structural features have a certain effect on the protein location classification, and can help improve the performance of sequence-based location predictors. Our method provides a new view for the analysis of protein spatial distribution, and is anticipated to be used in revealing the relationships between protein structures and functions. MDPI 2021-10-29 /pmc/articles/PMC8615982/ /pubmed/34827605 http://dx.doi.org/10.3390/biom11111607 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Ge
Zhai, Yu-Jia
Xue, Zhen-Zhen
Xu, Ying-Ying
Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information
title Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information
title_full Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information
title_fullStr Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information
title_full_unstemmed Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information
title_short Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information
title_sort improving protein subcellular location classification by incorporating three-dimensional structure information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8615982/
https://www.ncbi.nlm.nih.gov/pubmed/34827605
http://dx.doi.org/10.3390/biom11111607
work_keys_str_mv AT wangge improvingproteinsubcellularlocationclassificationbyincorporatingthreedimensionalstructureinformation
AT zhaiyujia improvingproteinsubcellularlocationclassificationbyincorporatingthreedimensionalstructureinformation
AT xuezhenzhen improvingproteinsubcellularlocationclassificationbyincorporatingthreedimensionalstructureinformation
AT xuyingying improvingproteinsubcellularlocationclassificationbyincorporatingthreedimensionalstructureinformation