Cargando…

Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics

BACKGROUND: Two factors impacting robustness of the original transmission disequilibrium test (TDT) are: i) missing parental genotypes and ii) undetected genotype errors. While it is known that independently these factors can inflate false-positive rates for the original TDT, no study has considered...

Descripción completa

Detalles Bibliográficos
Autores principales: Barral, Sandra, Haynes, Chad, Levenstien, Mark A, Gordon, Derek
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1866784/
https://www.ncbi.nlm.nih.gov/pubmed/16451611
http://dx.doi.org/10.1186/1471-2156-6-S1-S150
_version_ 1782133327589801984
author Barral, Sandra
Haynes, Chad
Levenstien, Mark A
Gordon, Derek
author_facet Barral, Sandra
Haynes, Chad
Levenstien, Mark A
Gordon, Derek
author_sort Barral, Sandra
collection PubMed
description BACKGROUND: Two factors impacting robustness of the original transmission disequilibrium test (TDT) are: i) missing parental genotypes and ii) undetected genotype errors. While it is known that independently these factors can inflate false-positive rates for the original TDT, no study has considered either the joint impact of these factors on false-positive rates or the precision score of TDT statistics regarding these factors. By precision score, we mean the absolute difference between disease gene position and the position of markers whose TDT statistic exceeds some threshold. METHODS: We apply our transmission disequilibrium test allowing for errors (TDTae) and the original TDT to phenotype and modified single-nucleotide polymorphism genotype simulation data from Genetic Analysis Workshop. We modify genotype data by randomly introducing genotype errors and removing a percentage of parental genotype data. We compute empirical distributions of each statistic's precision score for a chromosome harboring a simulated disease locus. We also consider inflation in type I error by studying markers on a chromosome harboring no disease locus. RESULTS: The TDTae shows median precision scores of approximately 13 cM, 2 cM, 0 cM, and 0 cM at the 5%, 1%, 0.1%, and 0.01% significance levels, respectively. By contrast, the original TDT shows median precision scores of approximately 23 cM, 21 cM, 15 cM, and 7 cM at the corresponding significance levels, respectively. For null chromosomes, the original TDT falsely rejects the null hypothesis for 28.8%, 14.8%, 5.4%, and 1.7% at the 5%, 1%, 0.1% and 0.01%, significance levels, respectively, while TDTae maintains the correct false-positive rate. CONCLUSION: Because missing parental genotypes and undetected genotype errors are unknown to the investigator, but are expected to be increasingly prevalent in multilocus datasets, we strongly recommend TDTae methods as a standard procedure, particularly where stricter significance levels are required.
format Text
id pubmed-1866784
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18667842007-05-11 Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics Barral, Sandra Haynes, Chad Levenstien, Mark A Gordon, Derek BMC Genet Proceedings BACKGROUND: Two factors impacting robustness of the original transmission disequilibrium test (TDT) are: i) missing parental genotypes and ii) undetected genotype errors. While it is known that independently these factors can inflate false-positive rates for the original TDT, no study has considered either the joint impact of these factors on false-positive rates or the precision score of TDT statistics regarding these factors. By precision score, we mean the absolute difference between disease gene position and the position of markers whose TDT statistic exceeds some threshold. METHODS: We apply our transmission disequilibrium test allowing for errors (TDTae) and the original TDT to phenotype and modified single-nucleotide polymorphism genotype simulation data from Genetic Analysis Workshop. We modify genotype data by randomly introducing genotype errors and removing a percentage of parental genotype data. We compute empirical distributions of each statistic's precision score for a chromosome harboring a simulated disease locus. We also consider inflation in type I error by studying markers on a chromosome harboring no disease locus. RESULTS: The TDTae shows median precision scores of approximately 13 cM, 2 cM, 0 cM, and 0 cM at the 5%, 1%, 0.1%, and 0.01% significance levels, respectively. By contrast, the original TDT shows median precision scores of approximately 23 cM, 21 cM, 15 cM, and 7 cM at the corresponding significance levels, respectively. For null chromosomes, the original TDT falsely rejects the null hypothesis for 28.8%, 14.8%, 5.4%, and 1.7% at the 5%, 1%, 0.1% and 0.01%, significance levels, respectively, while TDTae maintains the correct false-positive rate. CONCLUSION: Because missing parental genotypes and undetected genotype errors are unknown to the investigator, but are expected to be increasingly prevalent in multilocus datasets, we strongly recommend TDTae methods as a standard procedure, particularly where stricter significance levels are required. BioMed Central 2005-12-30 /pmc/articles/PMC1866784/ /pubmed/16451611 http://dx.doi.org/10.1186/1471-2156-6-S1-S150 Text en Copyright © 2005 Barral et al; licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Barral, Sandra
Haynes, Chad
Levenstien, Mark A
Gordon, Derek
Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics
title Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics
title_full Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics
title_fullStr Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics
title_full_unstemmed Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics
title_short Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics
title_sort precision and type i error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (tdt) and tdtae statistics
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1866784/
https://www.ncbi.nlm.nih.gov/pubmed/16451611
http://dx.doi.org/10.1186/1471-2156-6-S1-S150
work_keys_str_mv AT barralsandra precisionandtypeierrorrateinthepresenceofgenotypeerrorsandmissingparentaldataacomparisonbetweentheoriginaltransmissiondisequilibriumtesttdtandtdtaestatistics
AT hayneschad precisionandtypeierrorrateinthepresenceofgenotypeerrorsandmissingparentaldataacomparisonbetweentheoriginaltransmissiondisequilibriumtesttdtandtdtaestatistics
AT levenstienmarka precisionandtypeierrorrateinthepresenceofgenotypeerrorsandmissingparentaldataacomparisonbetweentheoriginaltransmissiondisequilibriumtesttdtandtdtaestatistics
AT gordonderek precisionandtypeierrorrateinthepresenceofgenotypeerrorsandmissingparentaldataacomparisonbetweentheoriginaltransmissiondisequilibriumtesttdtandtdtaestatistics