Cargando…
A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease
The simulated data set of the Genetic Analysis Workshop 15 provided affection status, four quantitative traits, and a covariate. After studying the relationship between these variables, linkage analysis was undertaken. Analyses were performed in the first replicate only and without any prior knowled...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367605/ https://www.ncbi.nlm.nih.gov/pubmed/18466528 |
_version_ | 1782154332154626048 |
---|---|
author | Pankratz, Nathan |
author_facet | Pankratz, Nathan |
author_sort | Pankratz, Nathan |
collection | PubMed |
description | The simulated data set of the Genetic Analysis Workshop 15 provided affection status, four quantitative traits, and a covariate. After studying the relationship between these variables, linkage analysis was undertaken. Analyses were performed in the first replicate only and without any prior knowledge of the underlying model. In addition to the main effect of the DR locus on chromosome 6, significant linkage was also identified on chromosomes 8, 9, 11, and 18. Notably, the power to detect linkage increased after transforming the skewed and kurtotic IgM and anti-CCP distributions. Moreover, genes on chromosome 11 could not be discerned from noise without the transformation, thus highlighting the need in real life situations for careful examination of the phenotypic data prior to genetic analysis. Significant association with one single-nucleotide polymorphism was identified for the regions on chromosome 11 and 18. Haplotype analyses were attempted for the other regions, but only the underlying variation of the DR locus could be identified. Two methods were then applied to predict classification using the factors identified so far. These methods – logistic regression and multifactor dimensionality reduction (MDR) – performed comparably for this data set. Those affected individuals that were misclassified as unaffected were then used in a genome-wide association analysis to identify additional susceptibility loci. Two additional loci were identified in this fashion, illustrating the usefulness of this two-stage classification approach. |
format | Text |
id | pubmed-2367605 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-23676052008-05-06 A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease Pankratz, Nathan BMC Proc Proceedings The simulated data set of the Genetic Analysis Workshop 15 provided affection status, four quantitative traits, and a covariate. After studying the relationship between these variables, linkage analysis was undertaken. Analyses were performed in the first replicate only and without any prior knowledge of the underlying model. In addition to the main effect of the DR locus on chromosome 6, significant linkage was also identified on chromosomes 8, 9, 11, and 18. Notably, the power to detect linkage increased after transforming the skewed and kurtotic IgM and anti-CCP distributions. Moreover, genes on chromosome 11 could not be discerned from noise without the transformation, thus highlighting the need in real life situations for careful examination of the phenotypic data prior to genetic analysis. Significant association with one single-nucleotide polymorphism was identified for the regions on chromosome 11 and 18. Haplotype analyses were attempted for the other regions, but only the underlying variation of the DR locus could be identified. Two methods were then applied to predict classification using the factors identified so far. These methods – logistic regression and multifactor dimensionality reduction (MDR) – performed comparably for this data set. Those affected individuals that were misclassified as unaffected were then used in a genome-wide association analysis to identify additional susceptibility loci. Two additional loci were identified in this fashion, illustrating the usefulness of this two-stage classification approach. BioMed Central 2007-12-18 /pmc/articles/PMC2367605/ /pubmed/18466528 Text en Copyright © 2007 Pankratz; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Pankratz, Nathan A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease |
title | A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease |
title_full | A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease |
title_fullStr | A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease |
title_full_unstemmed | A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease |
title_short | A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease |
title_sort | two-stage classification approach identifies seven susceptibility genes for a simulated complex disease |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367605/ https://www.ncbi.nlm.nih.gov/pubmed/18466528 |
work_keys_str_mv | AT pankratznathan atwostageclassificationapproachidentifiessevensusceptibilitygenesforasimulatedcomplexdisease AT pankratznathan twostageclassificationapproachidentifiessevensusceptibilitygenesforasimulatedcomplexdisease |