Cargando…

Design of experiments for fine-mapping quantitative trait loci in livestock populations

BACKGROUND: Single nucleotide polymorphisms (SNPs) which capture a significant impact on a trait can be identified with genome-wide association studies. High linkage disequilibrium (LD) among SNPs makes it difficult to identify causative variants correctly. Thus, often target regions instead of sing...

Descripción completa

Detalles Bibliográficos
Autores principales: Wittenburg, Dörte, Bonk, Sarah, Doschoris, Michael, Reyer, Henry
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324978/
https://www.ncbi.nlm.nih.gov/pubmed/32600319
http://dx.doi.org/10.1186/s12863-020-00871-1
_version_ 1783552064123568128
author Wittenburg, Dörte
Bonk, Sarah
Doschoris, Michael
Reyer, Henry
author_facet Wittenburg, Dörte
Bonk, Sarah
Doschoris, Michael
Reyer, Henry
author_sort Wittenburg, Dörte
collection PubMed
description BACKGROUND: Single nucleotide polymorphisms (SNPs) which capture a significant impact on a trait can be identified with genome-wide association studies. High linkage disequilibrium (LD) among SNPs makes it difficult to identify causative variants correctly. Thus, often target regions instead of single SNPs are reported. Sample size has not only a crucial impact on the precision of parameter estimates, it also ensures that a desired level of statistical power can be reached. We study the design of experiments for fine-mapping of signals of a quantitative trait locus in such a target region. METHODS: A multi-locus model allows to identify causative variants simultaneously, to state their positions more precisely and to account for existing dependencies. Based on the commonly applied SNP-BLUP approach, we determine the z-score statistic for locally testing non-zero SNP effects and investigate its distribution under the alternative hypothesis. This quantity employs the theoretical instead of observed dependence between SNPs; it can be set up as a function of paternal and maternal LD for any given population structure. RESULTS: We simulated multiple paternal half-sib families and considered a target region of 1 Mbp. A bimodal distribution of estimated sample size was observed, particularly if more than two causative variants were assumed. The median of estimates constituted the final proposal of optimal sample size; it was consistently less than sample size estimated from single-SNP investigation which was used as a baseline approach. The second mode pointed to inflated sample sizes and could be explained by blocks of varying linkage phases leading to negative correlations between SNPs. Optimal sample size increased almost linearly with number of signals to be identified but depended much stronger on the assumption on heritability. For instance, three times as many samples were required if heritability was 0.1 compared to 0.3. An R package is provided that comprises all required tools. CONCLUSIONS: Our approach incorporates information about the population structure into the design of experiments. Compared to a conventional method, this leads to a reduced estimate of sample size enabling the resource-saving design of future experiments for fine-mapping of candidate variants.
format Online
Article
Text
id pubmed-7324978
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73249782020-06-30 Design of experiments for fine-mapping quantitative trait loci in livestock populations Wittenburg, Dörte Bonk, Sarah Doschoris, Michael Reyer, Henry BMC Genet Methodology Article BACKGROUND: Single nucleotide polymorphisms (SNPs) which capture a significant impact on a trait can be identified with genome-wide association studies. High linkage disequilibrium (LD) among SNPs makes it difficult to identify causative variants correctly. Thus, often target regions instead of single SNPs are reported. Sample size has not only a crucial impact on the precision of parameter estimates, it also ensures that a desired level of statistical power can be reached. We study the design of experiments for fine-mapping of signals of a quantitative trait locus in such a target region. METHODS: A multi-locus model allows to identify causative variants simultaneously, to state their positions more precisely and to account for existing dependencies. Based on the commonly applied SNP-BLUP approach, we determine the z-score statistic for locally testing non-zero SNP effects and investigate its distribution under the alternative hypothesis. This quantity employs the theoretical instead of observed dependence between SNPs; it can be set up as a function of paternal and maternal LD for any given population structure. RESULTS: We simulated multiple paternal half-sib families and considered a target region of 1 Mbp. A bimodal distribution of estimated sample size was observed, particularly if more than two causative variants were assumed. The median of estimates constituted the final proposal of optimal sample size; it was consistently less than sample size estimated from single-SNP investigation which was used as a baseline approach. The second mode pointed to inflated sample sizes and could be explained by blocks of varying linkage phases leading to negative correlations between SNPs. Optimal sample size increased almost linearly with number of signals to be identified but depended much stronger on the assumption on heritability. For instance, three times as many samples were required if heritability was 0.1 compared to 0.3. An R package is provided that comprises all required tools. CONCLUSIONS: Our approach incorporates information about the population structure into the design of experiments. Compared to a conventional method, this leads to a reduced estimate of sample size enabling the resource-saving design of future experiments for fine-mapping of candidate variants. BioMed Central 2020-06-29 /pmc/articles/PMC7324978/ /pubmed/32600319 http://dx.doi.org/10.1186/s12863-020-00871-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visithttp://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Wittenburg, Dörte
Bonk, Sarah
Doschoris, Michael
Reyer, Henry
Design of experiments for fine-mapping quantitative trait loci in livestock populations
title Design of experiments for fine-mapping quantitative trait loci in livestock populations
title_full Design of experiments for fine-mapping quantitative trait loci in livestock populations
title_fullStr Design of experiments for fine-mapping quantitative trait loci in livestock populations
title_full_unstemmed Design of experiments for fine-mapping quantitative trait loci in livestock populations
title_short Design of experiments for fine-mapping quantitative trait loci in livestock populations
title_sort design of experiments for fine-mapping quantitative trait loci in livestock populations
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324978/
https://www.ncbi.nlm.nih.gov/pubmed/32600319
http://dx.doi.org/10.1186/s12863-020-00871-1
work_keys_str_mv AT wittenburgdorte designofexperimentsforfinemappingquantitativetraitlociinlivestockpopulations
AT bonksarah designofexperimentsforfinemappingquantitativetraitlociinlivestockpopulations
AT doschorismichael designofexperimentsforfinemappingquantitativetraitlociinlivestockpopulations
AT reyerhenry designofexperimentsforfinemappingquantitativetraitlociinlivestockpopulations