Cargando…

Haplotype Reconstruction Error as a Classical Misclassification Problem: Introducing Sensitivity and Specificity as Error Measures

BACKGROUND: Statistically reconstructing haplotypes from single nucleotide polymorphism (SNP) genotypes, can lead to falsely classified haplotypes. This can be an issue when interpreting haplotype association results or when selecting subjects with certain haplotypes for subsequent functional studie...

Descripción completa

Detalles Bibliográficos
Autores principales: Lamina, Claudia, Bongardt, Friedhelm, Küchenhoff, Helmut, Heid, Iris M.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2267219/
https://www.ncbi.nlm.nih.gov/pubmed/18365008
http://dx.doi.org/10.1371/journal.pone.0001853
_version_ 1782151627337105408
author Lamina, Claudia
Bongardt, Friedhelm
Küchenhoff, Helmut
Heid, Iris M.
author_facet Lamina, Claudia
Bongardt, Friedhelm
Küchenhoff, Helmut
Heid, Iris M.
author_sort Lamina, Claudia
collection PubMed
description BACKGROUND: Statistically reconstructing haplotypes from single nucleotide polymorphism (SNP) genotypes, can lead to falsely classified haplotypes. This can be an issue when interpreting haplotype association results or when selecting subjects with certain haplotypes for subsequent functional studies. It was our aim to quantify haplotype reconstruction error and to provide tools for it. METHODS AND RESULTS: By numerous simulation scenarios, we systematically investigated several error measures, including discrepancy, error rate, and R(2), and introduced the sensitivity and specificity to this context. We exemplified several measures in the KORA study, a large population-based study from Southern Germany. We find that the specificity is slightly reduced only for common haplotypes, while the sensitivity was decreased for some, but not all rare haplotypes. The overall error rate was generally increasing with increasing number of loci, increasing minor allele frequency of SNPs, decreasing correlation between the alleles and increasing ambiguity. CONCLUSIONS: We conclude that, with the analytical approach presented here, haplotype-specific error measures can be computed to gain insight into the haplotype uncertainty. This method provides the information, if a specific risk haplotype can be expected to be reconstructed with rather no or high misclassification and thus on the magnitude of expected bias in association estimates. We also illustrate that sensitivity and specificity separate two dimensions of the haplotype reconstruction error, which completely describe the misclassification matrix and thus provide the prerequisite for methods accounting for misclassification.
format Text
id pubmed-2267219
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-22672192008-03-26 Haplotype Reconstruction Error as a Classical Misclassification Problem: Introducing Sensitivity and Specificity as Error Measures Lamina, Claudia Bongardt, Friedhelm Küchenhoff, Helmut Heid, Iris M. PLoS One Research Article BACKGROUND: Statistically reconstructing haplotypes from single nucleotide polymorphism (SNP) genotypes, can lead to falsely classified haplotypes. This can be an issue when interpreting haplotype association results or when selecting subjects with certain haplotypes for subsequent functional studies. It was our aim to quantify haplotype reconstruction error and to provide tools for it. METHODS AND RESULTS: By numerous simulation scenarios, we systematically investigated several error measures, including discrepancy, error rate, and R(2), and introduced the sensitivity and specificity to this context. We exemplified several measures in the KORA study, a large population-based study from Southern Germany. We find that the specificity is slightly reduced only for common haplotypes, while the sensitivity was decreased for some, but not all rare haplotypes. The overall error rate was generally increasing with increasing number of loci, increasing minor allele frequency of SNPs, decreasing correlation between the alleles and increasing ambiguity. CONCLUSIONS: We conclude that, with the analytical approach presented here, haplotype-specific error measures can be computed to gain insight into the haplotype uncertainty. This method provides the information, if a specific risk haplotype can be expected to be reconstructed with rather no or high misclassification and thus on the magnitude of expected bias in association estimates. We also illustrate that sensitivity and specificity separate two dimensions of the haplotype reconstruction error, which completely describe the misclassification matrix and thus provide the prerequisite for methods accounting for misclassification. Public Library of Science 2008-03-26 /pmc/articles/PMC2267219/ /pubmed/18365008 http://dx.doi.org/10.1371/journal.pone.0001853 Text en Lamina et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lamina, Claudia
Bongardt, Friedhelm
Küchenhoff, Helmut
Heid, Iris M.
Haplotype Reconstruction Error as a Classical Misclassification Problem: Introducing Sensitivity and Specificity as Error Measures
title Haplotype Reconstruction Error as a Classical Misclassification Problem: Introducing Sensitivity and Specificity as Error Measures
title_full Haplotype Reconstruction Error as a Classical Misclassification Problem: Introducing Sensitivity and Specificity as Error Measures
title_fullStr Haplotype Reconstruction Error as a Classical Misclassification Problem: Introducing Sensitivity and Specificity as Error Measures
title_full_unstemmed Haplotype Reconstruction Error as a Classical Misclassification Problem: Introducing Sensitivity and Specificity as Error Measures
title_short Haplotype Reconstruction Error as a Classical Misclassification Problem: Introducing Sensitivity and Specificity as Error Measures
title_sort haplotype reconstruction error as a classical misclassification problem: introducing sensitivity and specificity as error measures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2267219/
https://www.ncbi.nlm.nih.gov/pubmed/18365008
http://dx.doi.org/10.1371/journal.pone.0001853
work_keys_str_mv AT laminaclaudia haplotypereconstructionerrorasaclassicalmisclassificationproblemintroducingsensitivityandspecificityaserrormeasures
AT bongardtfriedhelm haplotypereconstructionerrorasaclassicalmisclassificationproblemintroducingsensitivityandspecificityaserrormeasures
AT kuchenhoffhelmut haplotypereconstructionerrorasaclassicalmisclassificationproblemintroducingsensitivityandspecificityaserrormeasures
AT heidirism haplotypereconstructionerrorasaclassicalmisclassificationproblemintroducingsensitivityandspecificityaserrormeasures