Cargando…

A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease

The simulated data set of the Genetic Analysis Workshop 15 provided affection status, four quantitative traits, and a covariate. After studying the relationship between these variables, linkage analysis was undertaken. Analyses were performed in the first replicate only and without any prior knowled...

Descripción completa

Detalles Bibliográficos
Autor principal: Pankratz, Nathan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367605/
https://www.ncbi.nlm.nih.gov/pubmed/18466528
_version_ 1782154332154626048
author Pankratz, Nathan
author_facet Pankratz, Nathan
author_sort Pankratz, Nathan
collection PubMed
description The simulated data set of the Genetic Analysis Workshop 15 provided affection status, four quantitative traits, and a covariate. After studying the relationship between these variables, linkage analysis was undertaken. Analyses were performed in the first replicate only and without any prior knowledge of the underlying model. In addition to the main effect of the DR locus on chromosome 6, significant linkage was also identified on chromosomes 8, 9, 11, and 18. Notably, the power to detect linkage increased after transforming the skewed and kurtotic IgM and anti-CCP distributions. Moreover, genes on chromosome 11 could not be discerned from noise without the transformation, thus highlighting the need in real life situations for careful examination of the phenotypic data prior to genetic analysis. Significant association with one single-nucleotide polymorphism was identified for the regions on chromosome 11 and 18. Haplotype analyses were attempted for the other regions, but only the underlying variation of the DR locus could be identified. Two methods were then applied to predict classification using the factors identified so far. These methods – logistic regression and multifactor dimensionality reduction (MDR) – performed comparably for this data set. Those affected individuals that were misclassified as unaffected were then used in a genome-wide association analysis to identify additional susceptibility loci. Two additional loci were identified in this fashion, illustrating the usefulness of this two-stage classification approach.
format Text
id pubmed-2367605
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23676052008-05-06 A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease Pankratz, Nathan BMC Proc Proceedings The simulated data set of the Genetic Analysis Workshop 15 provided affection status, four quantitative traits, and a covariate. After studying the relationship between these variables, linkage analysis was undertaken. Analyses were performed in the first replicate only and without any prior knowledge of the underlying model. In addition to the main effect of the DR locus on chromosome 6, significant linkage was also identified on chromosomes 8, 9, 11, and 18. Notably, the power to detect linkage increased after transforming the skewed and kurtotic IgM and anti-CCP distributions. Moreover, genes on chromosome 11 could not be discerned from noise without the transformation, thus highlighting the need in real life situations for careful examination of the phenotypic data prior to genetic analysis. Significant association with one single-nucleotide polymorphism was identified for the regions on chromosome 11 and 18. Haplotype analyses were attempted for the other regions, but only the underlying variation of the DR locus could be identified. Two methods were then applied to predict classification using the factors identified so far. These methods – logistic regression and multifactor dimensionality reduction (MDR) – performed comparably for this data set. Those affected individuals that were misclassified as unaffected were then used in a genome-wide association analysis to identify additional susceptibility loci. Two additional loci were identified in this fashion, illustrating the usefulness of this two-stage classification approach. BioMed Central 2007-12-18 /pmc/articles/PMC2367605/ /pubmed/18466528 Text en Copyright © 2007 Pankratz; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Pankratz, Nathan
A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease
title A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease
title_full A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease
title_fullStr A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease
title_full_unstemmed A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease
title_short A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease
title_sort two-stage classification approach identifies seven susceptibility genes for a simulated complex disease
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367605/
https://www.ncbi.nlm.nih.gov/pubmed/18466528
work_keys_str_mv AT pankratznathan atwostageclassificationapproachidentifiessevensusceptibilitygenesforasimulatedcomplexdisease
AT pankratznathan twostageclassificationapproachidentifiessevensusceptibilitygenesforasimulatedcomplexdisease