Cargando…

False-positive rates in two-point parametric linkage analysis

Two-point linkage analyses of whole genome sequence data are a promising approach to identify rare variants that segregate with complex diseases in large pedigrees because, in theory, the causal variants have been genotyped. We used whole genome sequence data and simulated traits provided by Genetic...

Descripción completa

Detalles Bibliográficos
Autores principales: Szymczak, Silke, Simpson, Claire L, Cropp, Cheryl D, Bailey-Wilson, Joan E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143621/
https://www.ncbi.nlm.nih.gov/pubmed/25519363
http://dx.doi.org/10.1186/1753-6561-8-S1-S110
_version_ 1782331924764688384
author Szymczak, Silke
Simpson, Claire L
Cropp, Cheryl D
Bailey-Wilson, Joan E
author_facet Szymczak, Silke
Simpson, Claire L
Cropp, Cheryl D
Bailey-Wilson, Joan E
author_sort Szymczak, Silke
collection PubMed
description Two-point linkage analyses of whole genome sequence data are a promising approach to identify rare variants that segregate with complex diseases in large pedigrees because, in theory, the causal variants have been genotyped. We used whole genome sequence data and simulated traits provided by Genetic Analysis Workshop 18 to evaluate the proportion of false-positive findings in a binary trait using classic two-point parametric linkage analysis. False-positive genome-wide significant log of odds (LOD) scores were identified in more than 80% of 50 replicates for a binary phenotype generated by dichotomizing a quantitative trait that was simulated with a polygenic component (that was not based on any of the provided whole genome sequence genotypes). In contrast, when the trait was truly nongenetic (created by randomly assigning affected-unaffected status), the number of false-positive results was well controlled. These results suggest that when using two-point linkage analyses on whole genome sequence data, one should carefully examine regions yielding significant two-point LOD scores with multipoint analysis and that a more stringent significance threshold may be needed.
format Online
Article
Text
id pubmed-4143621
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41436212014-09-02 False-positive rates in two-point parametric linkage analysis Szymczak, Silke Simpson, Claire L Cropp, Cheryl D Bailey-Wilson, Joan E BMC Proc Proceedings Two-point linkage analyses of whole genome sequence data are a promising approach to identify rare variants that segregate with complex diseases in large pedigrees because, in theory, the causal variants have been genotyped. We used whole genome sequence data and simulated traits provided by Genetic Analysis Workshop 18 to evaluate the proportion of false-positive findings in a binary trait using classic two-point parametric linkage analysis. False-positive genome-wide significant log of odds (LOD) scores were identified in more than 80% of 50 replicates for a binary phenotype generated by dichotomizing a quantitative trait that was simulated with a polygenic component (that was not based on any of the provided whole genome sequence genotypes). In contrast, when the trait was truly nongenetic (created by randomly assigning affected-unaffected status), the number of false-positive results was well controlled. These results suggest that when using two-point linkage analyses on whole genome sequence data, one should carefully examine regions yielding significant two-point LOD scores with multipoint analysis and that a more stringent significance threshold may be needed. BioMed Central 2014-06-17 /pmc/articles/PMC4143621/ /pubmed/25519363 http://dx.doi.org/10.1186/1753-6561-8-S1-S110 Text en Copyright © 2014 Szymczak et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Szymczak, Silke
Simpson, Claire L
Cropp, Cheryl D
Bailey-Wilson, Joan E
False-positive rates in two-point parametric linkage analysis
title False-positive rates in two-point parametric linkage analysis
title_full False-positive rates in two-point parametric linkage analysis
title_fullStr False-positive rates in two-point parametric linkage analysis
title_full_unstemmed False-positive rates in two-point parametric linkage analysis
title_short False-positive rates in two-point parametric linkage analysis
title_sort false-positive rates in two-point parametric linkage analysis
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143621/
https://www.ncbi.nlm.nih.gov/pubmed/25519363
http://dx.doi.org/10.1186/1753-6561-8-S1-S110
work_keys_str_mv AT szymczaksilke falsepositiveratesintwopointparametriclinkageanalysis
AT simpsonclairel falsepositiveratesintwopointparametriclinkageanalysis
AT croppcheryld falsepositiveratesintwopointparametriclinkageanalysis
AT baileywilsonjoane falsepositiveratesintwopointparametriclinkageanalysis