Cargando…
Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle
BACKGROUND: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6584984/ https://www.ncbi.nlm.nih.gov/pubmed/31221101 http://dx.doi.org/10.1186/s12711-019-0469-3 |
_version_ | 1783428613127798784 |
---|---|
author | Aguilar, Ignacio Legarra, Andres Cardoso, Fernando Masuda, Yutaka Lourenco, Daniela Misztal, Ignacy |
author_facet | Aguilar, Ignacio Legarra, Andres Cardoso, Fernando Masuda, Yutaka Lourenco, Daniela Misztal, Ignacy |
author_sort | Aguilar, Ignacio |
collection | PubMed |
description | BACKGROUND: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to implement p-values for single-marker GWAS studies within the single-step GWAS (SSGWAS) framework by deriving computational algorithms and procedures, and by applying these to a large beef cattle population. METHODS: P-values were obtained based on the prediction error (co)variances for single nucleotide polymorphisms (SNPs), which were obtained from the prediction error (co)variances of genomic predictions based on the inverse of the coefficient matrix and formulas to estimate SNP effects. RESULTS: Computation of p-values took a negligible time for a dataset with almost 2 million animals in the pedigree and 1424 genotyped sires, and no inflation of statistics was observed. The SNPs that passed the Bonferroni threshold of 10(−5.9) were the same as those that explained the highest proportion of additive genetic variance, but even at the same significance levels and effects, some of them explained less genetic variance due to lower allele frequency. CONCLUSIONS: The use of a p-value for SSGWAS is a very general and efficient strategy to identify quantitative trait loci (QTL). It can be used for complex datasets such as those used in animal breeding, where only a proportion of the pedigreed animals are genotyped. |
format | Online Article Text |
id | pubmed-6584984 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-65849842019-06-27 Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle Aguilar, Ignacio Legarra, Andres Cardoso, Fernando Masuda, Yutaka Lourenco, Daniela Misztal, Ignacy Genet Sel Evol Short Communication BACKGROUND: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to implement p-values for single-marker GWAS studies within the single-step GWAS (SSGWAS) framework by deriving computational algorithms and procedures, and by applying these to a large beef cattle population. METHODS: P-values were obtained based on the prediction error (co)variances for single nucleotide polymorphisms (SNPs), which were obtained from the prediction error (co)variances of genomic predictions based on the inverse of the coefficient matrix and formulas to estimate SNP effects. RESULTS: Computation of p-values took a negligible time for a dataset with almost 2 million animals in the pedigree and 1424 genotyped sires, and no inflation of statistics was observed. The SNPs that passed the Bonferroni threshold of 10(−5.9) were the same as those that explained the highest proportion of additive genetic variance, but even at the same significance levels and effects, some of them explained less genetic variance due to lower allele frequency. CONCLUSIONS: The use of a p-value for SSGWAS is a very general and efficient strategy to identify quantitative trait loci (QTL). It can be used for complex datasets such as those used in animal breeding, where only a proportion of the pedigreed animals are genotyped. BioMed Central 2019-06-20 /pmc/articles/PMC6584984/ /pubmed/31221101 http://dx.doi.org/10.1186/s12711-019-0469-3 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Short Communication Aguilar, Ignacio Legarra, Andres Cardoso, Fernando Masuda, Yutaka Lourenco, Daniela Misztal, Ignacy Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle |
title | Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle |
title_full | Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle |
title_fullStr | Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle |
title_full_unstemmed | Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle |
title_short | Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle |
title_sort | frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in american angus cattle |
topic | Short Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6584984/ https://www.ncbi.nlm.nih.gov/pubmed/31221101 http://dx.doi.org/10.1186/s12711-019-0469-3 |
work_keys_str_mv | AT aguilarignacio frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle AT legarraandres frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle AT cardosofernando frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle AT masudayutaka frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle AT lourencodaniela frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle AT misztalignacy frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle |