Cargando…
Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
BACKGROUND: Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering int...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905366/ https://www.ncbi.nlm.nih.gov/pubmed/20550715 http://dx.doi.org/10.1186/1471-2105-11-325 |
_version_ | 1782183955343081472 |
---|---|
author | Li, Zhanchao Zhou, Xuan Dai, Zong Zou, Xiaoyong |
author_facet | Li, Zhanchao Zhou, Xuan Dai, Zong Zou, Xiaoyong |
author_sort | Li, Zhanchao |
collection | PubMed |
description | BACKGROUND: Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs. RESULTS: In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred. CONCLUSION: The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors. |
format | Text |
id | pubmed-2905366 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-29053662010-07-17 Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm Li, Zhanchao Zhou, Xuan Dai, Zong Zou, Xiaoyong BMC Bioinformatics Research Article BACKGROUND: Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs. RESULTS: In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred. CONCLUSION: The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors. BioMed Central 2010-06-16 /pmc/articles/PMC2905366/ /pubmed/20550715 http://dx.doi.org/10.1186/1471-2105-11-325 Text en Copyright ©2010 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Li, Zhanchao Zhou, Xuan Dai, Zong Zou, Xiaoyong Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm |
title | Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm |
title_full | Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm |
title_fullStr | Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm |
title_full_unstemmed | Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm |
title_short | Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm |
title_sort | classification of g-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905366/ https://www.ncbi.nlm.nih.gov/pubmed/20550715 http://dx.doi.org/10.1186/1471-2105-11-325 |
work_keys_str_mv | AT lizhanchao classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm AT zhouxuan classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm AT daizong classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm AT zouxiaoyong classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm |