Cargando…

Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm

BACKGROUND: Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering int...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zhanchao, Zhou, Xuan, Dai, Zong, Zou, Xiaoyong
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905366/
https://www.ncbi.nlm.nih.gov/pubmed/20550715
http://dx.doi.org/10.1186/1471-2105-11-325
_version_ 1782183955343081472
author Li, Zhanchao
Zhou, Xuan
Dai, Zong
Zou, Xiaoyong
author_facet Li, Zhanchao
Zhou, Xuan
Dai, Zong
Zou, Xiaoyong
author_sort Li, Zhanchao
collection PubMed
description BACKGROUND: Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs. RESULTS: In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred. CONCLUSION: The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors.
format Text
id pubmed-2905366
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29053662010-07-17 Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm Li, Zhanchao Zhou, Xuan Dai, Zong Zou, Xiaoyong BMC Bioinformatics Research Article BACKGROUND: Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs. RESULTS: In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred. CONCLUSION: The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors. BioMed Central 2010-06-16 /pmc/articles/PMC2905366/ /pubmed/20550715 http://dx.doi.org/10.1186/1471-2105-11-325 Text en Copyright ©2010 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Li, Zhanchao
Zhou, Xuan
Dai, Zong
Zou, Xiaoyong
Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_full Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_fullStr Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_full_unstemmed Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_short Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_sort classification of g-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905366/
https://www.ncbi.nlm.nih.gov/pubmed/20550715
http://dx.doi.org/10.1186/1471-2105-11-325
work_keys_str_mv AT lizhanchao classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm
AT zhouxuan classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm
AT daizong classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm
AT zouxiaoyong classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm