Cargando…

Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes

Residue fluctuations in protein structures have been shown to be highly associated with various protein functions. Gaussian network model (GNM), a simple representative coarse-grained model, was widely adopted to reveal function-related protein dynamics. We directly utilized the high frequency modes...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Hua, Jiang, Tao, Shan, Guogen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5110947/
https://www.ncbi.nlm.nih.gov/pubmed/27882325
http://dx.doi.org/10.1155/2016/4354901
_version_ 1782467776176193536
author Zhang, Hua
Jiang, Tao
Shan, Guogen
author_facet Zhang, Hua
Jiang, Tao
Shan, Guogen
author_sort Zhang, Hua
collection PubMed
description Residue fluctuations in protein structures have been shown to be highly associated with various protein functions. Gaussian network model (GNM), a simple representative coarse-grained model, was widely adopted to reveal function-related protein dynamics. We directly utilized the high frequency modes generated by GNM and further performed Gaussian Naive Bayes (GNB) to identify hot spot residues. Two coding schemes about the feature vectors were implemented with varying distance cutoffs for GNM and sliding window sizes for GNB based on tenfold cross validations: one by using only a single high mode and the other by combining multiple modes with the highest frequency. Our proposed methods outperformed the previous work that did not directly utilize the high frequency modes generated by GNM, with regard to overall performance evaluated using F1 measure. Moreover, we found that inclusion of more high frequency modes for a GNB classifier can significantly improve the sensitivity. The present study provided additional valuable insights into the relation between the hot spots and the residue fluctuations.
format Online
Article
Text
id pubmed-5110947
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-51109472016-11-23 Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes Zhang, Hua Jiang, Tao Shan, Guogen Biomed Res Int Research Article Residue fluctuations in protein structures have been shown to be highly associated with various protein functions. Gaussian network model (GNM), a simple representative coarse-grained model, was widely adopted to reveal function-related protein dynamics. We directly utilized the high frequency modes generated by GNM and further performed Gaussian Naive Bayes (GNB) to identify hot spot residues. Two coding schemes about the feature vectors were implemented with varying distance cutoffs for GNM and sliding window sizes for GNB based on tenfold cross validations: one by using only a single high mode and the other by combining multiple modes with the highest frequency. Our proposed methods outperformed the previous work that did not directly utilize the high frequency modes generated by GNM, with regard to overall performance evaluated using F1 measure. Moreover, we found that inclusion of more high frequency modes for a GNB classifier can significantly improve the sensitivity. The present study provided additional valuable insights into the relation between the hot spots and the residue fluctuations. Hindawi Publishing Corporation 2016 2016-11-02 /pmc/articles/PMC5110947/ /pubmed/27882325 http://dx.doi.org/10.1155/2016/4354901 Text en Copyright © 2016 Hua Zhang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Hua
Jiang, Tao
Shan, Guogen
Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes
title Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes
title_full Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes
title_fullStr Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes
title_full_unstemmed Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes
title_short Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes
title_sort identification of hot spots in protein structures using gaussian network model and gaussian naive bayes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5110947/
https://www.ncbi.nlm.nih.gov/pubmed/27882325
http://dx.doi.org/10.1155/2016/4354901
work_keys_str_mv AT zhanghua identificationofhotspotsinproteinstructuresusinggaussiannetworkmodelandgaussiannaivebayes
AT jiangtao identificationofhotspotsinproteinstructuresusinggaussiannetworkmodelandgaussiannaivebayes
AT shanguogen identificationofhotspotsinproteinstructuresusinggaussiannetworkmodelandgaussiannaivebayes