Cargando…

The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets

We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD cal...

Descripción completa

Detalles Bibliográficos
Autores principales: McCaskie, Pamela A, Carter, Kim W, McCaskie, Simon R, Palmer, Lyle J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1866783/
https://www.ncbi.nlm.nih.gov/pubmed/16451612
http://dx.doi.org/10.1186/1471-2156-6-S1-S151
_version_ 1782133327233286144
author McCaskie, Pamela A
Carter, Kim W
McCaskie, Simon R
Palmer, Lyle J
author_facet McCaskie, Pamela A
Carter, Kim W
McCaskie, Simon R
Palmer, Lyle J
author_sort McCaskie, Pamela A
collection PubMed
description We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD calculations. Our haplotype analysis program, SIMHAP, was used to assess the effect of missing data on haplotype-phenotype association. Genotype data was removed at random, at levels of 1%, 5%, and 10%, and the LD calculations and haplotype association results for these levels of missingness were compared to those for the complete dataset. It was concluded that ignoring individuals with missing data substantially affects the number of regions of LD detected which, in turn, could affect tagging SNPs chosen to generate haplotypes.
format Text
id pubmed-1866783
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18667832007-05-11 The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets McCaskie, Pamela A Carter, Kim W McCaskie, Simon R Palmer, Lyle J BMC Genet Proceedings We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD calculations. Our haplotype analysis program, SIMHAP, was used to assess the effect of missing data on haplotype-phenotype association. Genotype data was removed at random, at levels of 1%, 5%, and 10%, and the LD calculations and haplotype association results for these levels of missingness were compared to those for the complete dataset. It was concluded that ignoring individuals with missing data substantially affects the number of regions of LD detected which, in turn, could affect tagging SNPs chosen to generate haplotypes. BioMed Central 2005-12-30 /pmc/articles/PMC1866783/ /pubmed/16451612 http://dx.doi.org/10.1186/1471-2156-6-S1-S151 Text en Copyright © 2005 McCaskie et al; licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
McCaskie, Pamela A
Carter, Kim W
McCaskie, Simon R
Palmer, Lyle J
The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets
title The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets
title_full The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets
title_fullStr The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets
title_full_unstemmed The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets
title_short The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets
title_sort effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the gaw14 simulated datasets
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1866783/
https://www.ncbi.nlm.nih.gov/pubmed/16451612
http://dx.doi.org/10.1186/1471-2156-6-S1-S151
work_keys_str_mv AT mccaskiepamelaa theeffectofmissingdataonlinkagedisequilibriummappingandhaplotypeassociationanalysisinthegaw14simulateddatasets
AT carterkimw theeffectofmissingdataonlinkagedisequilibriummappingandhaplotypeassociationanalysisinthegaw14simulateddatasets
AT mccaskiesimonr theeffectofmissingdataonlinkagedisequilibriummappingandhaplotypeassociationanalysisinthegaw14simulateddatasets
AT palmerlylej theeffectofmissingdataonlinkagedisequilibriummappingandhaplotypeassociationanalysisinthegaw14simulateddatasets
AT mccaskiepamelaa effectofmissingdataonlinkagedisequilibriummappingandhaplotypeassociationanalysisinthegaw14simulateddatasets
AT carterkimw effectofmissingdataonlinkagedisequilibriummappingandhaplotypeassociationanalysisinthegaw14simulateddatasets
AT mccaskiesimonr effectofmissingdataonlinkagedisequilibriummappingandhaplotypeassociationanalysisinthegaw14simulateddatasets
AT palmerlylej effectofmissingdataonlinkagedisequilibriummappingandhaplotypeassociationanalysisinthegaw14simulateddatasets