Cargando…

A study on separation of the protein structural types in amino acid sequence feature spaces

Proteins are diverse with their sequences, structures and functions, it is important to study the relations between the sequences, structures and functions. In this paper, we conduct a study that surveying the relations between the protein sequences and their structures. In this study, we use the na...

Descripción completa

Detalles Bibliográficos
Autores principales: Wan, Xiaogeng, Tan, Xinying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927603/
https://www.ncbi.nlm.nih.gov/pubmed/31869390
http://dx.doi.org/10.1371/journal.pone.0226768
_version_ 1783482327605706752
author Wan, Xiaogeng
Tan, Xinying
author_facet Wan, Xiaogeng
Tan, Xinying
author_sort Wan, Xiaogeng
collection PubMed
description Proteins are diverse with their sequences, structures and functions, it is important to study the relations between the sequences, structures and functions. In this paper, we conduct a study that surveying the relations between the protein sequences and their structures. In this study, we use the natural vector (NV) and the averaged property factor (APF) features to represent protein sequences into feature vectors, and use the multi-class MSE and the convex hull methods to separate proteins of different structural classes into different regions. We found that proteins from different structural classes are separable by hyper-planes and convex hulls in the natural vector feature space, where the feature vectors of different structural classes are separated into disjoint regions or convex hulls in the high dimensional feature spaces. The natural vector outperforms the averaged property factor method in identifying the structures, and the convex hull method outperforms the multi-class MSE in separating the feature points. These outcomes convince the strong connections between the protein sequences and their structures, and may imply that the amino acids composition and their sequence arrangements represented by the natural vectors have greater influences to the structures than the averaged physical property factors of the amino acids.
format Online
Article
Text
id pubmed-6927603
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69276032020-01-07 A study on separation of the protein structural types in amino acid sequence feature spaces Wan, Xiaogeng Tan, Xinying PLoS One Research Article Proteins are diverse with their sequences, structures and functions, it is important to study the relations between the sequences, structures and functions. In this paper, we conduct a study that surveying the relations between the protein sequences and their structures. In this study, we use the natural vector (NV) and the averaged property factor (APF) features to represent protein sequences into feature vectors, and use the multi-class MSE and the convex hull methods to separate proteins of different structural classes into different regions. We found that proteins from different structural classes are separable by hyper-planes and convex hulls in the natural vector feature space, where the feature vectors of different structural classes are separated into disjoint regions or convex hulls in the high dimensional feature spaces. The natural vector outperforms the averaged property factor method in identifying the structures, and the convex hull method outperforms the multi-class MSE in separating the feature points. These outcomes convince the strong connections between the protein sequences and their structures, and may imply that the amino acids composition and their sequence arrangements represented by the natural vectors have greater influences to the structures than the averaged physical property factors of the amino acids. Public Library of Science 2019-12-23 /pmc/articles/PMC6927603/ /pubmed/31869390 http://dx.doi.org/10.1371/journal.pone.0226768 Text en © 2019 Wan, Tan http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wan, Xiaogeng
Tan, Xinying
A study on separation of the protein structural types in amino acid sequence feature spaces
title A study on separation of the protein structural types in amino acid sequence feature spaces
title_full A study on separation of the protein structural types in amino acid sequence feature spaces
title_fullStr A study on separation of the protein structural types in amino acid sequence feature spaces
title_full_unstemmed A study on separation of the protein structural types in amino acid sequence feature spaces
title_short A study on separation of the protein structural types in amino acid sequence feature spaces
title_sort study on separation of the protein structural types in amino acid sequence feature spaces
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927603/
https://www.ncbi.nlm.nih.gov/pubmed/31869390
http://dx.doi.org/10.1371/journal.pone.0226768
work_keys_str_mv AT wanxiaogeng astudyonseparationoftheproteinstructuraltypesinaminoacidsequencefeaturespaces
AT tanxinying astudyonseparationoftheproteinstructuraltypesinaminoacidsequencefeaturespaces
AT wanxiaogeng studyonseparationoftheproteinstructuraltypesinaminoacidsequencefeaturespaces
AT tanxinying studyonseparationoftheproteinstructuraltypesinaminoacidsequencefeaturespaces