Cargando…
ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier
Protein fold classification plays an important role in both protein functional analysis and drug design. The number of proteins in PDB is very large, but only a very small part is categorized and stored in the SCOPe database. Therefore, it is necessary to develop an efficient method for protein fold...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5021882/ https://www.ncbi.nlm.nih.gov/pubmed/27660761 http://dx.doi.org/10.1155/2016/6802832 |
_version_ | 1782453413387173888 |
---|---|
author | Chen, Daozheng Tian, Xiaoyu Zhou, Bo Gao, Jun |
author_facet | Chen, Daozheng Tian, Xiaoyu Zhou, Bo Gao, Jun |
author_sort | Chen, Daozheng |
collection | PubMed |
description | Protein fold classification plays an important role in both protein functional analysis and drug design. The number of proteins in PDB is very large, but only a very small part is categorized and stored in the SCOPe database. Therefore, it is necessary to develop an efficient method for protein fold classification. In recent years, a variety of classification methods have been used in many protein fold classification studies. In this study, we propose a novel classification method called proFold. We import protein tertiary structure in the period of feature extraction and employ a novel ensemble strategy in the period of classifier training. Compared with existing similar ensemble classifiers using the same widely used dataset (DD-dataset), proFold achieves 76.2% overall accuracy. Another two commonly used datasets, EDD-dataset and TG-dataset, are also tested, of which the accuracies are 93.2% and 94.3%, higher than the existing methods. ProFold is available to the public as a web-server. |
format | Online Article Text |
id | pubmed-5021882 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-50218822016-09-22 ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier Chen, Daozheng Tian, Xiaoyu Zhou, Bo Gao, Jun Biomed Res Int Research Article Protein fold classification plays an important role in both protein functional analysis and drug design. The number of proteins in PDB is very large, but only a very small part is categorized and stored in the SCOPe database. Therefore, it is necessary to develop an efficient method for protein fold classification. In recent years, a variety of classification methods have been used in many protein fold classification studies. In this study, we propose a novel classification method called proFold. We import protein tertiary structure in the period of feature extraction and employ a novel ensemble strategy in the period of classifier training. Compared with existing similar ensemble classifiers using the same widely used dataset (DD-dataset), proFold achieves 76.2% overall accuracy. Another two commonly used datasets, EDD-dataset and TG-dataset, are also tested, of which the accuracies are 93.2% and 94.3%, higher than the existing methods. ProFold is available to the public as a web-server. Hindawi Publishing Corporation 2016 2016-08-28 /pmc/articles/PMC5021882/ /pubmed/27660761 http://dx.doi.org/10.1155/2016/6802832 Text en Copyright © 2016 Daozheng Chen et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chen, Daozheng Tian, Xiaoyu Zhou, Bo Gao, Jun ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier |
title | ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier |
title_full | ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier |
title_fullStr | ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier |
title_full_unstemmed | ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier |
title_short | ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier |
title_sort | profold: protein fold classification with additional structural features and a novel ensemble classifier |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5021882/ https://www.ncbi.nlm.nih.gov/pubmed/27660761 http://dx.doi.org/10.1155/2016/6802832 |
work_keys_str_mv | AT chendaozheng profoldproteinfoldclassificationwithadditionalstructuralfeaturesandanovelensembleclassifier AT tianxiaoyu profoldproteinfoldclassificationwithadditionalstructuralfeaturesandanovelensembleclassifier AT zhoubo profoldproteinfoldclassificationwithadditionalstructuralfeaturesandanovelensembleclassifier AT gaojun profoldproteinfoldclassificationwithadditionalstructuralfeaturesandanovelensembleclassifier |