Cargando…

ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier

Protein fold classification plays an important role in both protein functional analysis and drug design. The number of proteins in PDB is very large, but only a very small part is categorized and stored in the SCOPe database. Therefore, it is necessary to develop an efficient method for protein fold...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Daozheng, Tian, Xiaoyu, Zhou, Bo, Gao, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5021882/
https://www.ncbi.nlm.nih.gov/pubmed/27660761
http://dx.doi.org/10.1155/2016/6802832
_version_ 1782453413387173888
author Chen, Daozheng
Tian, Xiaoyu
Zhou, Bo
Gao, Jun
author_facet Chen, Daozheng
Tian, Xiaoyu
Zhou, Bo
Gao, Jun
author_sort Chen, Daozheng
collection PubMed
description Protein fold classification plays an important role in both protein functional analysis and drug design. The number of proteins in PDB is very large, but only a very small part is categorized and stored in the SCOPe database. Therefore, it is necessary to develop an efficient method for protein fold classification. In recent years, a variety of classification methods have been used in many protein fold classification studies. In this study, we propose a novel classification method called proFold. We import protein tertiary structure in the period of feature extraction and employ a novel ensemble strategy in the period of classifier training. Compared with existing similar ensemble classifiers using the same widely used dataset (DD-dataset), proFold achieves 76.2% overall accuracy. Another two commonly used datasets, EDD-dataset and TG-dataset, are also tested, of which the accuracies are 93.2% and 94.3%, higher than the existing methods. ProFold is available to the public as a web-server.
format Online
Article
Text
id pubmed-5021882
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-50218822016-09-22 ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier Chen, Daozheng Tian, Xiaoyu Zhou, Bo Gao, Jun Biomed Res Int Research Article Protein fold classification plays an important role in both protein functional analysis and drug design. The number of proteins in PDB is very large, but only a very small part is categorized and stored in the SCOPe database. Therefore, it is necessary to develop an efficient method for protein fold classification. In recent years, a variety of classification methods have been used in many protein fold classification studies. In this study, we propose a novel classification method called proFold. We import protein tertiary structure in the period of feature extraction and employ a novel ensemble strategy in the period of classifier training. Compared with existing similar ensemble classifiers using the same widely used dataset (DD-dataset), proFold achieves 76.2% overall accuracy. Another two commonly used datasets, EDD-dataset and TG-dataset, are also tested, of which the accuracies are 93.2% and 94.3%, higher than the existing methods. ProFold is available to the public as a web-server. Hindawi Publishing Corporation 2016 2016-08-28 /pmc/articles/PMC5021882/ /pubmed/27660761 http://dx.doi.org/10.1155/2016/6802832 Text en Copyright © 2016 Daozheng Chen et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Daozheng
Tian, Xiaoyu
Zhou, Bo
Gao, Jun
ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier
title ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier
title_full ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier
title_fullStr ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier
title_full_unstemmed ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier
title_short ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier
title_sort profold: protein fold classification with additional structural features and a novel ensemble classifier
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5021882/
https://www.ncbi.nlm.nih.gov/pubmed/27660761
http://dx.doi.org/10.1155/2016/6802832
work_keys_str_mv AT chendaozheng profoldproteinfoldclassificationwithadditionalstructuralfeaturesandanovelensembleclassifier
AT tianxiaoyu profoldproteinfoldclassificationwithadditionalstructuralfeaturesandanovelensembleclassifier
AT zhoubo profoldproteinfoldclassificationwithadditionalstructuralfeaturesandanovelensembleclassifier
AT gaojun profoldproteinfoldclassificationwithadditionalstructuralfeaturesandanovelensembleclassifier