Cargando…

Identification of Type 2 Diabetes-associated combination of SNPs using Support Vector Machine

BACKGROUND: Type 2 diabetes mellitus (T2D), a metabolic disorder characterized by insulin resistance and relative insulin deficiency, is a complex disease of major public health importance. Its incidence is rapidly increasing in the developed countries. Complex diseases are caused by interactions be...

Descripción completa

Detalles Bibliográficos
Autores principales: Ban, Hyo-Jeong, Heo, Jee Yeon, Oh, Kyung-Soo, Park, Keun-Joon
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2875201/
https://www.ncbi.nlm.nih.gov/pubmed/20416077
http://dx.doi.org/10.1186/1471-2156-11-26
_version_ 1782181549465141248
author Ban, Hyo-Jeong
Heo, Jee Yeon
Oh, Kyung-Soo
Park, Keun-Joon
author_facet Ban, Hyo-Jeong
Heo, Jee Yeon
Oh, Kyung-Soo
Park, Keun-Joon
author_sort Ban, Hyo-Jeong
collection PubMed
description BACKGROUND: Type 2 diabetes mellitus (T2D), a metabolic disorder characterized by insulin resistance and relative insulin deficiency, is a complex disease of major public health importance. Its incidence is rapidly increasing in the developed countries. Complex diseases are caused by interactions between multiple genes and environmental factors. Most association studies aim to identify individual susceptibility single markers using a simple disease model. Recent studies are trying to estimate the effects of multiple genes and multi-locus in genome-wide association. However, estimating the effects of association is very difficult. We aim to assess the rules for classifying diseased and normal subjects by evaluating potential gene-gene interactions in the same or distinct biological pathways. RESULTS: We analyzed the importance of gene-gene interactions in T2D susceptibility by investigating 408 single nucleotide polymorphisms (SNPs) in 87 genes involved in major T2D-related pathways in 462 T2D patients and 456 healthy controls from the Korean cohort studies. We evaluated the support vector machine (SVM) method to differentiate between cases and controls using SNP information in a 10-fold cross-validation test. We achieved a 65.3% prediction rate with a combination of 14 SNPs in 12 genes by using the radial basis function (RBF)-kernel SVM. Similarly, we investigated subpopulation data sets of men and women and identified different SNP combinations with the prediction rates of 70.9% and 70.6%, respectively. As the high-throughput technology for genome-wide SNPs improves, it is likely that a much higher prediction rate with biologically more interesting combination of SNPs can be acquired by using this method. CONCLUSIONS: Support Vector Machine based feature selection method in this research found novel association between combinations of SNPs and T2D in a Korean population.
format Text
id pubmed-2875201
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28752012010-05-25 Identification of Type 2 Diabetes-associated combination of SNPs using Support Vector Machine Ban, Hyo-Jeong Heo, Jee Yeon Oh, Kyung-Soo Park, Keun-Joon BMC Genet Research article BACKGROUND: Type 2 diabetes mellitus (T2D), a metabolic disorder characterized by insulin resistance and relative insulin deficiency, is a complex disease of major public health importance. Its incidence is rapidly increasing in the developed countries. Complex diseases are caused by interactions between multiple genes and environmental factors. Most association studies aim to identify individual susceptibility single markers using a simple disease model. Recent studies are trying to estimate the effects of multiple genes and multi-locus in genome-wide association. However, estimating the effects of association is very difficult. We aim to assess the rules for classifying diseased and normal subjects by evaluating potential gene-gene interactions in the same or distinct biological pathways. RESULTS: We analyzed the importance of gene-gene interactions in T2D susceptibility by investigating 408 single nucleotide polymorphisms (SNPs) in 87 genes involved in major T2D-related pathways in 462 T2D patients and 456 healthy controls from the Korean cohort studies. We evaluated the support vector machine (SVM) method to differentiate between cases and controls using SNP information in a 10-fold cross-validation test. We achieved a 65.3% prediction rate with a combination of 14 SNPs in 12 genes by using the radial basis function (RBF)-kernel SVM. Similarly, we investigated subpopulation data sets of men and women and identified different SNP combinations with the prediction rates of 70.9% and 70.6%, respectively. As the high-throughput technology for genome-wide SNPs improves, it is likely that a much higher prediction rate with biologically more interesting combination of SNPs can be acquired by using this method. CONCLUSIONS: Support Vector Machine based feature selection method in this research found novel association between combinations of SNPs and T2D in a Korean population. BioMed Central 2010-04-23 /pmc/articles/PMC2875201/ /pubmed/20416077 http://dx.doi.org/10.1186/1471-2156-11-26 Text en Copyright ©2010 Ban et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Ban, Hyo-Jeong
Heo, Jee Yeon
Oh, Kyung-Soo
Park, Keun-Joon
Identification of Type 2 Diabetes-associated combination of SNPs using Support Vector Machine
title Identification of Type 2 Diabetes-associated combination of SNPs using Support Vector Machine
title_full Identification of Type 2 Diabetes-associated combination of SNPs using Support Vector Machine
title_fullStr Identification of Type 2 Diabetes-associated combination of SNPs using Support Vector Machine
title_full_unstemmed Identification of Type 2 Diabetes-associated combination of SNPs using Support Vector Machine
title_short Identification of Type 2 Diabetes-associated combination of SNPs using Support Vector Machine
title_sort identification of type 2 diabetes-associated combination of snps using support vector machine
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2875201/
https://www.ncbi.nlm.nih.gov/pubmed/20416077
http://dx.doi.org/10.1186/1471-2156-11-26
work_keys_str_mv AT banhyojeong identificationoftype2diabetesassociatedcombinationofsnpsusingsupportvectormachine
AT heojeeyeon identificationoftype2diabetesassociatedcombinationofsnpsusingsupportvectormachine
AT ohkyungsoo identificationoftype2diabetesassociatedcombinationofsnpsusingsupportvectormachine
AT parkkeunjoon identificationoftype2diabetesassociatedcombinationofsnpsusingsupportvectormachine