Cargando…

Effective Moment Feature Vectors for Protein Domain Structures

Imaging processing techniques have been shown to be useful in studying protein domain structures. The idea is to represent the pairwise distances of any two residues of the structure in a 2D distance matrix (DM). Features and/or submatrices are extracted from this DM to represent a domain. Existing...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Jian-Yu, Yiu, Siu-Ming, Zhang, Yan-Ning, Chin, Francis Yuk-Lun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3877117/
https://www.ncbi.nlm.nih.gov/pubmed/24391828
http://dx.doi.org/10.1371/journal.pone.0083788
_version_ 1782297596047392768
author Shi, Jian-Yu
Yiu, Siu-Ming
Zhang, Yan-Ning
Chin, Francis Yuk-Lun
author_facet Shi, Jian-Yu
Yiu, Siu-Ming
Zhang, Yan-Ning
Chin, Francis Yuk-Lun
author_sort Shi, Jian-Yu
collection PubMed
description Imaging processing techniques have been shown to be useful in studying protein domain structures. The idea is to represent the pairwise distances of any two residues of the structure in a 2D distance matrix (DM). Features and/or submatrices are extracted from this DM to represent a domain. Existing approaches, however, may involve a large number of features (100–400) or complicated mathematical operations. Finding fewer but more effective features is always desirable. In this paper, based on some key observations on DMs, we are able to decompose a DM image into four basic binary images, each representing the structural characteristics of a fundamental secondary structure element (SSE) or a motif in the domain. Using the concept of moments in image processing, we further derive 45 structural features based on the four binary images. Together with 4 features extracted from the basic images, we represent the structure of a domain using 49 features. We show that our feature vectors can represent domain structures effectively in terms of the following. (1) We show a higher accuracy for domain classification. (2) We show a clear and consistent distribution of domains using our proposed structural vector space. (3) We are able to cluster the domains according to our moment features and demonstrate a relationship between structural variation and functional diversity.
format Online
Article
Text
id pubmed-3877117
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38771172014-01-03 Effective Moment Feature Vectors for Protein Domain Structures Shi, Jian-Yu Yiu, Siu-Ming Zhang, Yan-Ning Chin, Francis Yuk-Lun PLoS One Research Article Imaging processing techniques have been shown to be useful in studying protein domain structures. The idea is to represent the pairwise distances of any two residues of the structure in a 2D distance matrix (DM). Features and/or submatrices are extracted from this DM to represent a domain. Existing approaches, however, may involve a large number of features (100–400) or complicated mathematical operations. Finding fewer but more effective features is always desirable. In this paper, based on some key observations on DMs, we are able to decompose a DM image into four basic binary images, each representing the structural characteristics of a fundamental secondary structure element (SSE) or a motif in the domain. Using the concept of moments in image processing, we further derive 45 structural features based on the four binary images. Together with 4 features extracted from the basic images, we represent the structure of a domain using 49 features. We show that our feature vectors can represent domain structures effectively in terms of the following. (1) We show a higher accuracy for domain classification. (2) We show a clear and consistent distribution of domains using our proposed structural vector space. (3) We are able to cluster the domains according to our moment features and demonstrate a relationship between structural variation and functional diversity. Public Library of Science 2013-12-31 /pmc/articles/PMC3877117/ /pubmed/24391828 http://dx.doi.org/10.1371/journal.pone.0083788 Text en © 2013 Shi et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Shi, Jian-Yu
Yiu, Siu-Ming
Zhang, Yan-Ning
Chin, Francis Yuk-Lun
Effective Moment Feature Vectors for Protein Domain Structures
title Effective Moment Feature Vectors for Protein Domain Structures
title_full Effective Moment Feature Vectors for Protein Domain Structures
title_fullStr Effective Moment Feature Vectors for Protein Domain Structures
title_full_unstemmed Effective Moment Feature Vectors for Protein Domain Structures
title_short Effective Moment Feature Vectors for Protein Domain Structures
title_sort effective moment feature vectors for protein domain structures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3877117/
https://www.ncbi.nlm.nih.gov/pubmed/24391828
http://dx.doi.org/10.1371/journal.pone.0083788
work_keys_str_mv AT shijianyu effectivemomentfeaturevectorsforproteindomainstructures
AT yiusiuming effectivemomentfeaturevectorsforproteindomainstructures
AT zhangyanning effectivemomentfeaturevectorsforproteindomainstructures
AT chinfrancisyuklun effectivemomentfeaturevectorsforproteindomainstructures