Cargando…

Using feature optimization-based support vector machine method to recognize the β-hairpin motifs in enzymes

β-Hairpins in enzyme, a kind of special protein with catalytic functions, contain many binding sites which are essential for the functions of enzyme. With the increasing number of observed enzyme protein sequences, it is of especial importance to use bioinformatics techniques to quickly and accurate...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Dongmei, Hu, Xiuzhen, Liu, Xingxing, Feng, Zhenxing, Ding, Changjiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5562482/
https://www.ncbi.nlm.nih.gov/pubmed/28855832
http://dx.doi.org/10.1016/j.sjbs.2016.11.014
_version_ 1783257975511580672
author Li, Dongmei
Hu, Xiuzhen
Liu, Xingxing
Feng, Zhenxing
Ding, Changjiang
author_facet Li, Dongmei
Hu, Xiuzhen
Liu, Xingxing
Feng, Zhenxing
Ding, Changjiang
author_sort Li, Dongmei
collection PubMed
description β-Hairpins in enzyme, a kind of special protein with catalytic functions, contain many binding sites which are essential for the functions of enzyme. With the increasing number of observed enzyme protein sequences, it is of especial importance to use bioinformatics techniques to quickly and accurately identify the β-hairpin in enzyme protein for further advanced annotation of structure and function of enzyme. In this work, the proposed method was trained and tested on a non-redundant enzyme β-hairpin database containing 2818 β-hairpins and 1098 non-β-hairpins. With 5-fold cross-validation on the training dataset, the overall accuracy of 90.08% and Matthew’s correlation coefficient (Mcc) of 0.74 were obtained, while on the independent test dataset, the overall accuracy of 88.93% and Mcc of 0.76 were achieved. Furthermore, the method was validated on 845 β-hairpins with ligand binding sites. With 5-fold cross-validation on the training dataset and independent test on the test dataset, the overall accuracies were 85.82% (Mcc of 0.71) and 84.78% (Mcc of 0.70), respectively. With an integration of mRMR feature selection and SVM algorithm, a reasonable high accuracy was achieved, indicating the method to be an effective tool for the further studies of β-hairpins in enzymes structure. Additionally, as a novelty for function prediction of enzymes, β-hairpins with ligand binding sites were predicted. Based on this work, a web server was constructed to predict β-hairpin motifs in enzymes (http://202.207.29.251:8080/).
format Online
Article
Text
id pubmed-5562482
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-55624822017-08-30 Using feature optimization-based support vector machine method to recognize the β-hairpin motifs in enzymes Li, Dongmei Hu, Xiuzhen Liu, Xingxing Feng, Zhenxing Ding, Changjiang Saudi J Biol Sci Original Article β-Hairpins in enzyme, a kind of special protein with catalytic functions, contain many binding sites which are essential for the functions of enzyme. With the increasing number of observed enzyme protein sequences, it is of especial importance to use bioinformatics techniques to quickly and accurately identify the β-hairpin in enzyme protein for further advanced annotation of structure and function of enzyme. In this work, the proposed method was trained and tested on a non-redundant enzyme β-hairpin database containing 2818 β-hairpins and 1098 non-β-hairpins. With 5-fold cross-validation on the training dataset, the overall accuracy of 90.08% and Matthew’s correlation coefficient (Mcc) of 0.74 were obtained, while on the independent test dataset, the overall accuracy of 88.93% and Mcc of 0.76 were achieved. Furthermore, the method was validated on 845 β-hairpins with ligand binding sites. With 5-fold cross-validation on the training dataset and independent test on the test dataset, the overall accuracies were 85.82% (Mcc of 0.71) and 84.78% (Mcc of 0.70), respectively. With an integration of mRMR feature selection and SVM algorithm, a reasonable high accuracy was achieved, indicating the method to be an effective tool for the further studies of β-hairpins in enzymes structure. Additionally, as a novelty for function prediction of enzymes, β-hairpins with ligand binding sites were predicted. Based on this work, a web server was constructed to predict β-hairpin motifs in enzymes (http://202.207.29.251:8080/). Elsevier 2017-09 2016-11-28 /pmc/articles/PMC5562482/ /pubmed/28855832 http://dx.doi.org/10.1016/j.sjbs.2016.11.014 Text en © 2016 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Article
Li, Dongmei
Hu, Xiuzhen
Liu, Xingxing
Feng, Zhenxing
Ding, Changjiang
Using feature optimization-based support vector machine method to recognize the β-hairpin motifs in enzymes
title Using feature optimization-based support vector machine method to recognize the β-hairpin motifs in enzymes
title_full Using feature optimization-based support vector machine method to recognize the β-hairpin motifs in enzymes
title_fullStr Using feature optimization-based support vector machine method to recognize the β-hairpin motifs in enzymes
title_full_unstemmed Using feature optimization-based support vector machine method to recognize the β-hairpin motifs in enzymes
title_short Using feature optimization-based support vector machine method to recognize the β-hairpin motifs in enzymes
title_sort using feature optimization-based support vector machine method to recognize the β-hairpin motifs in enzymes
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5562482/
https://www.ncbi.nlm.nih.gov/pubmed/28855832
http://dx.doi.org/10.1016/j.sjbs.2016.11.014
work_keys_str_mv AT lidongmei usingfeatureoptimizationbasedsupportvectormachinemethodtorecognizethebhairpinmotifsinenzymes
AT huxiuzhen usingfeatureoptimizationbasedsupportvectormachinemethodtorecognizethebhairpinmotifsinenzymes
AT liuxingxing usingfeatureoptimizationbasedsupportvectormachinemethodtorecognizethebhairpinmotifsinenzymes
AT fengzhenxing usingfeatureoptimizationbasedsupportvectormachinemethodtorecognizethebhairpinmotifsinenzymes
AT dingchangjiang usingfeatureoptimizationbasedsupportvectormachinemethodtorecognizethebhairpinmotifsinenzymes