Cargando…

Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method

BACKGROUND: Systematic evaluation and study of single nucleotide polymorphisms (SNPs) made possible by high throughput genotyping technologies and bioinformatics promises to provide breakthroughs in the understanding of complex diseases. Understanding how the millions of SNPs in the human genome are...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Qian, Ratnasinghe, Luke D, Hong, Huixiao, Perkins, Roger, Tang, Ze-Zhong, Hu, Nan, Taylor, Philip R, Tong, Weida
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1637030/
https://www.ncbi.nlm.nih.gov/pubmed/16026601
http://dx.doi.org/10.1186/1471-2105-6-S2-S4
_version_ 1782130780485451776
author Xie, Qian
Ratnasinghe, Luke D
Hong, Huixiao
Perkins, Roger
Tang, Ze-Zhong
Hu, Nan
Taylor, Philip R
Tong, Weida
author_facet Xie, Qian
Ratnasinghe, Luke D
Hong, Huixiao
Perkins, Roger
Tang, Ze-Zhong
Hu, Nan
Taylor, Philip R
Tong, Weida
author_sort Xie, Qian
collection PubMed
description BACKGROUND: Systematic evaluation and study of single nucleotide polymorphisms (SNPs) made possible by high throughput genotyping technologies and bioinformatics promises to provide breakthroughs in the understanding of complex diseases. Understanding how the millions of SNPs in the human genome are involved in conferring susceptibility or resistance to disease, or in rendering a drug efficacious or toxic in the individual is a major goal of the relatively new fields of pharmacogenomics. Esophageal squamous cell carcinoma is a high-mortality cancer with complex etiology and progression involving both genetic and environmental factors. We examined the association between esophageal cancer risk and patterns of 61 SNPs in a case-control study for a population from Shanxi Province in North Central China that has among the highest rates of esophageal squamous cell carcinoma in the world. METHODS: High-throughput Masscode mass spectrometry genotyping was done on genomic DNA from 574 individuals (394 cases and 180 age-frequency matched controls). SNPs were chosen from among genes involving DNA repair enzymes, and Phase I and Phase II enzymes. We developed a novel adaptation of the Decision Forest pattern recognition method named Decision Forest for SNPs (DF-SNPs). The method was designated to analyze the SNP data. RESULTS: The classifier in separating the cases from the controls developed with DF-SNPs gave concordance, sensitivity and specificity, of 94.7%, 99.0% and 85.1%, respectively; suggesting its usefulness for hypothesizing what SNPs or combinations of SNPs could be involved in susceptibility to esophageal cancer. Importantly, the DF-SNPs algorithm incorporated a randomization test for assessing the relevance (or importance) of individual SNPs, SNP types (Homozygous common, heterozygous and homozygous variant) and patterns of SNP types (SNP patterns) that differentiate cases from controls. For example, we found that the different genotypes of SNP GADD45B E1122 are all associated with cancer risk. CONCLUSION: The DF-SNPs method can be used to differentiate esophageal squamous cell carcinoma cases from controls based on individual SNPs, SNP types and SNP patterns. The method could be useful to identify potential biomarkers from the SNP data and complement existing methods for genotype analyses.
format Text
id pubmed-1637030
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16370302006-11-16 Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method Xie, Qian Ratnasinghe, Luke D Hong, Huixiao Perkins, Roger Tang, Ze-Zhong Hu, Nan Taylor, Philip R Tong, Weida BMC Bioinformatics Proceedings BACKGROUND: Systematic evaluation and study of single nucleotide polymorphisms (SNPs) made possible by high throughput genotyping technologies and bioinformatics promises to provide breakthroughs in the understanding of complex diseases. Understanding how the millions of SNPs in the human genome are involved in conferring susceptibility or resistance to disease, or in rendering a drug efficacious or toxic in the individual is a major goal of the relatively new fields of pharmacogenomics. Esophageal squamous cell carcinoma is a high-mortality cancer with complex etiology and progression involving both genetic and environmental factors. We examined the association between esophageal cancer risk and patterns of 61 SNPs in a case-control study for a population from Shanxi Province in North Central China that has among the highest rates of esophageal squamous cell carcinoma in the world. METHODS: High-throughput Masscode mass spectrometry genotyping was done on genomic DNA from 574 individuals (394 cases and 180 age-frequency matched controls). SNPs were chosen from among genes involving DNA repair enzymes, and Phase I and Phase II enzymes. We developed a novel adaptation of the Decision Forest pattern recognition method named Decision Forest for SNPs (DF-SNPs). The method was designated to analyze the SNP data. RESULTS: The classifier in separating the cases from the controls developed with DF-SNPs gave concordance, sensitivity and specificity, of 94.7%, 99.0% and 85.1%, respectively; suggesting its usefulness for hypothesizing what SNPs or combinations of SNPs could be involved in susceptibility to esophageal cancer. Importantly, the DF-SNPs algorithm incorporated a randomization test for assessing the relevance (or importance) of individual SNPs, SNP types (Homozygous common, heterozygous and homozygous variant) and patterns of SNP types (SNP patterns) that differentiate cases from controls. For example, we found that the different genotypes of SNP GADD45B E1122 are all associated with cancer risk. CONCLUSION: The DF-SNPs method can be used to differentiate esophageal squamous cell carcinoma cases from controls based on individual SNPs, SNP types and SNP patterns. The method could be useful to identify potential biomarkers from the SNP data and complement existing methods for genotype analyses. BioMed Central 2005-07-15 /pmc/articles/PMC1637030/ /pubmed/16026601 http://dx.doi.org/10.1186/1471-2105-6-S2-S4 Text en Copyright © 2006 Xie et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Xie, Qian
Ratnasinghe, Luke D
Hong, Huixiao
Perkins, Roger
Tang, Ze-Zhong
Hu, Nan
Taylor, Philip R
Tong, Weida
Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method
title Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method
title_full Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method
title_fullStr Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method
title_full_unstemmed Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method
title_short Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method
title_sort decision forest analysis of 61 single nucleotide polymorphisms in a case-control study of esophageal cancer; a novel method
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1637030/
https://www.ncbi.nlm.nih.gov/pubmed/16026601
http://dx.doi.org/10.1186/1471-2105-6-S2-S4
work_keys_str_mv AT xieqian decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod
AT ratnasingheluked decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod
AT honghuixiao decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod
AT perkinsroger decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod
AT tangzezhong decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod
AT hunan decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod
AT taylorphilipr decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod
AT tongweida decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod