Cargando…
Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes
Residue fluctuations in protein structures have been shown to be highly associated with various protein functions. Gaussian network model (GNM), a simple representative coarse-grained model, was widely adopted to reveal function-related protein dynamics. We directly utilized the high frequency modes...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5110947/ https://www.ncbi.nlm.nih.gov/pubmed/27882325 http://dx.doi.org/10.1155/2016/4354901 |
_version_ | 1782467776176193536 |
---|---|
author | Zhang, Hua Jiang, Tao Shan, Guogen |
author_facet | Zhang, Hua Jiang, Tao Shan, Guogen |
author_sort | Zhang, Hua |
collection | PubMed |
description | Residue fluctuations in protein structures have been shown to be highly associated with various protein functions. Gaussian network model (GNM), a simple representative coarse-grained model, was widely adopted to reveal function-related protein dynamics. We directly utilized the high frequency modes generated by GNM and further performed Gaussian Naive Bayes (GNB) to identify hot spot residues. Two coding schemes about the feature vectors were implemented with varying distance cutoffs for GNM and sliding window sizes for GNB based on tenfold cross validations: one by using only a single high mode and the other by combining multiple modes with the highest frequency. Our proposed methods outperformed the previous work that did not directly utilize the high frequency modes generated by GNM, with regard to overall performance evaluated using F1 measure. Moreover, we found that inclusion of more high frequency modes for a GNB classifier can significantly improve the sensitivity. The present study provided additional valuable insights into the relation between the hot spots and the residue fluctuations. |
format | Online Article Text |
id | pubmed-5110947 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-51109472016-11-23 Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes Zhang, Hua Jiang, Tao Shan, Guogen Biomed Res Int Research Article Residue fluctuations in protein structures have been shown to be highly associated with various protein functions. Gaussian network model (GNM), a simple representative coarse-grained model, was widely adopted to reveal function-related protein dynamics. We directly utilized the high frequency modes generated by GNM and further performed Gaussian Naive Bayes (GNB) to identify hot spot residues. Two coding schemes about the feature vectors were implemented with varying distance cutoffs for GNM and sliding window sizes for GNB based on tenfold cross validations: one by using only a single high mode and the other by combining multiple modes with the highest frequency. Our proposed methods outperformed the previous work that did not directly utilize the high frequency modes generated by GNM, with regard to overall performance evaluated using F1 measure. Moreover, we found that inclusion of more high frequency modes for a GNB classifier can significantly improve the sensitivity. The present study provided additional valuable insights into the relation between the hot spots and the residue fluctuations. Hindawi Publishing Corporation 2016 2016-11-02 /pmc/articles/PMC5110947/ /pubmed/27882325 http://dx.doi.org/10.1155/2016/4354901 Text en Copyright © 2016 Hua Zhang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhang, Hua Jiang, Tao Shan, Guogen Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes |
title | Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes |
title_full | Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes |
title_fullStr | Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes |
title_full_unstemmed | Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes |
title_short | Identification of Hot Spots in Protein Structures Using Gaussian Network Model and Gaussian Naive Bayes |
title_sort | identification of hot spots in protein structures using gaussian network model and gaussian naive bayes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5110947/ https://www.ncbi.nlm.nih.gov/pubmed/27882325 http://dx.doi.org/10.1155/2016/4354901 |
work_keys_str_mv | AT zhanghua identificationofhotspotsinproteinstructuresusinggaussiannetworkmodelandgaussiannaivebayes AT jiangtao identificationofhotspotsinproteinstructuresusinggaussiannetworkmodelandgaussiannaivebayes AT shanguogen identificationofhotspotsinproteinstructuresusinggaussiannetworkmodelandgaussiannaivebayes |