Cargando…
Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method
BACKGROUND: Systematic evaluation and study of single nucleotide polymorphisms (SNPs) made possible by high throughput genotyping technologies and bioinformatics promises to provide breakthroughs in the understanding of complex diseases. Understanding how the millions of SNPs in the human genome are...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1637030/ https://www.ncbi.nlm.nih.gov/pubmed/16026601 http://dx.doi.org/10.1186/1471-2105-6-S2-S4 |
_version_ | 1782130780485451776 |
---|---|
author | Xie, Qian Ratnasinghe, Luke D Hong, Huixiao Perkins, Roger Tang, Ze-Zhong Hu, Nan Taylor, Philip R Tong, Weida |
author_facet | Xie, Qian Ratnasinghe, Luke D Hong, Huixiao Perkins, Roger Tang, Ze-Zhong Hu, Nan Taylor, Philip R Tong, Weida |
author_sort | Xie, Qian |
collection | PubMed |
description | BACKGROUND: Systematic evaluation and study of single nucleotide polymorphisms (SNPs) made possible by high throughput genotyping technologies and bioinformatics promises to provide breakthroughs in the understanding of complex diseases. Understanding how the millions of SNPs in the human genome are involved in conferring susceptibility or resistance to disease, or in rendering a drug efficacious or toxic in the individual is a major goal of the relatively new fields of pharmacogenomics. Esophageal squamous cell carcinoma is a high-mortality cancer with complex etiology and progression involving both genetic and environmental factors. We examined the association between esophageal cancer risk and patterns of 61 SNPs in a case-control study for a population from Shanxi Province in North Central China that has among the highest rates of esophageal squamous cell carcinoma in the world. METHODS: High-throughput Masscode mass spectrometry genotyping was done on genomic DNA from 574 individuals (394 cases and 180 age-frequency matched controls). SNPs were chosen from among genes involving DNA repair enzymes, and Phase I and Phase II enzymes. We developed a novel adaptation of the Decision Forest pattern recognition method named Decision Forest for SNPs (DF-SNPs). The method was designated to analyze the SNP data. RESULTS: The classifier in separating the cases from the controls developed with DF-SNPs gave concordance, sensitivity and specificity, of 94.7%, 99.0% and 85.1%, respectively; suggesting its usefulness for hypothesizing what SNPs or combinations of SNPs could be involved in susceptibility to esophageal cancer. Importantly, the DF-SNPs algorithm incorporated a randomization test for assessing the relevance (or importance) of individual SNPs, SNP types (Homozygous common, heterozygous and homozygous variant) and patterns of SNP types (SNP patterns) that differentiate cases from controls. For example, we found that the different genotypes of SNP GADD45B E1122 are all associated with cancer risk. CONCLUSION: The DF-SNPs method can be used to differentiate esophageal squamous cell carcinoma cases from controls based on individual SNPs, SNP types and SNP patterns. The method could be useful to identify potential biomarkers from the SNP data and complement existing methods for genotype analyses. |
format | Text |
id | pubmed-1637030 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-16370302006-11-16 Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method Xie, Qian Ratnasinghe, Luke D Hong, Huixiao Perkins, Roger Tang, Ze-Zhong Hu, Nan Taylor, Philip R Tong, Weida BMC Bioinformatics Proceedings BACKGROUND: Systematic evaluation and study of single nucleotide polymorphisms (SNPs) made possible by high throughput genotyping technologies and bioinformatics promises to provide breakthroughs in the understanding of complex diseases. Understanding how the millions of SNPs in the human genome are involved in conferring susceptibility or resistance to disease, or in rendering a drug efficacious or toxic in the individual is a major goal of the relatively new fields of pharmacogenomics. Esophageal squamous cell carcinoma is a high-mortality cancer with complex etiology and progression involving both genetic and environmental factors. We examined the association between esophageal cancer risk and patterns of 61 SNPs in a case-control study for a population from Shanxi Province in North Central China that has among the highest rates of esophageal squamous cell carcinoma in the world. METHODS: High-throughput Masscode mass spectrometry genotyping was done on genomic DNA from 574 individuals (394 cases and 180 age-frequency matched controls). SNPs were chosen from among genes involving DNA repair enzymes, and Phase I and Phase II enzymes. We developed a novel adaptation of the Decision Forest pattern recognition method named Decision Forest for SNPs (DF-SNPs). The method was designated to analyze the SNP data. RESULTS: The classifier in separating the cases from the controls developed with DF-SNPs gave concordance, sensitivity and specificity, of 94.7%, 99.0% and 85.1%, respectively; suggesting its usefulness for hypothesizing what SNPs or combinations of SNPs could be involved in susceptibility to esophageal cancer. Importantly, the DF-SNPs algorithm incorporated a randomization test for assessing the relevance (or importance) of individual SNPs, SNP types (Homozygous common, heterozygous and homozygous variant) and patterns of SNP types (SNP patterns) that differentiate cases from controls. For example, we found that the different genotypes of SNP GADD45B E1122 are all associated with cancer risk. CONCLUSION: The DF-SNPs method can be used to differentiate esophageal squamous cell carcinoma cases from controls based on individual SNPs, SNP types and SNP patterns. The method could be useful to identify potential biomarkers from the SNP data and complement existing methods for genotype analyses. BioMed Central 2005-07-15 /pmc/articles/PMC1637030/ /pubmed/16026601 http://dx.doi.org/10.1186/1471-2105-6-S2-S4 Text en Copyright © 2006 Xie et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Xie, Qian Ratnasinghe, Luke D Hong, Huixiao Perkins, Roger Tang, Ze-Zhong Hu, Nan Taylor, Philip R Tong, Weida Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method |
title | Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method |
title_full | Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method |
title_fullStr | Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method |
title_full_unstemmed | Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method |
title_short | Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method |
title_sort | decision forest analysis of 61 single nucleotide polymorphisms in a case-control study of esophageal cancer; a novel method |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1637030/ https://www.ncbi.nlm.nih.gov/pubmed/16026601 http://dx.doi.org/10.1186/1471-2105-6-S2-S4 |
work_keys_str_mv | AT xieqian decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod AT ratnasingheluked decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod AT honghuixiao decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod AT perkinsroger decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod AT tangzezhong decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod AT hunan decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod AT taylorphilipr decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod AT tongweida decisionforestanalysisof61singlenucleotidepolymorphismsinacasecontrolstudyofesophagealcanceranovelmethod |