Cargando…

Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle

BACKGROUND: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to...

Descripción completa

Detalles Bibliográficos
Autores principales: Aguilar, Ignacio, Legarra, Andres, Cardoso, Fernando, Masuda, Yutaka, Lourenco, Daniela, Misztal, Ignacy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6584984/
https://www.ncbi.nlm.nih.gov/pubmed/31221101
http://dx.doi.org/10.1186/s12711-019-0469-3
_version_ 1783428613127798784
author Aguilar, Ignacio
Legarra, Andres
Cardoso, Fernando
Masuda, Yutaka
Lourenco, Daniela
Misztal, Ignacy
author_facet Aguilar, Ignacio
Legarra, Andres
Cardoso, Fernando
Masuda, Yutaka
Lourenco, Daniela
Misztal, Ignacy
author_sort Aguilar, Ignacio
collection PubMed
description BACKGROUND: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to implement p-values for single-marker GWAS studies within the single-step GWAS (SSGWAS) framework by deriving computational algorithms and procedures, and by applying these to a large beef cattle population. METHODS: P-values were obtained based on the prediction error (co)variances for single nucleotide polymorphisms (SNPs), which were obtained from the prediction error (co)variances of genomic predictions based on the inverse of the coefficient matrix and formulas to estimate SNP effects. RESULTS: Computation of p-values took a negligible time for a dataset with almost 2 million animals in the pedigree and 1424 genotyped sires, and no inflation of statistics was observed. The SNPs that passed the Bonferroni threshold of 10(−5.9) were the same as those that explained the highest proportion of additive genetic variance, but even at the same significance levels and effects, some of them explained less genetic variance due to lower allele frequency. CONCLUSIONS: The use of a p-value for SSGWAS is a very general and efficient strategy to identify quantitative trait loci (QTL). It can be used for complex datasets such as those used in animal breeding, where only a proportion of the pedigreed animals are genotyped.
format Online
Article
Text
id pubmed-6584984
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65849842019-06-27 Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle Aguilar, Ignacio Legarra, Andres Cardoso, Fernando Masuda, Yutaka Lourenco, Daniela Misztal, Ignacy Genet Sel Evol Short Communication BACKGROUND: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to implement p-values for single-marker GWAS studies within the single-step GWAS (SSGWAS) framework by deriving computational algorithms and procedures, and by applying these to a large beef cattle population. METHODS: P-values were obtained based on the prediction error (co)variances for single nucleotide polymorphisms (SNPs), which were obtained from the prediction error (co)variances of genomic predictions based on the inverse of the coefficient matrix and formulas to estimate SNP effects. RESULTS: Computation of p-values took a negligible time for a dataset with almost 2 million animals in the pedigree and 1424 genotyped sires, and no inflation of statistics was observed. The SNPs that passed the Bonferroni threshold of 10(−5.9) were the same as those that explained the highest proportion of additive genetic variance, but even at the same significance levels and effects, some of them explained less genetic variance due to lower allele frequency. CONCLUSIONS: The use of a p-value for SSGWAS is a very general and efficient strategy to identify quantitative trait loci (QTL). It can be used for complex datasets such as those used in animal breeding, where only a proportion of the pedigreed animals are genotyped. BioMed Central 2019-06-20 /pmc/articles/PMC6584984/ /pubmed/31221101 http://dx.doi.org/10.1186/s12711-019-0469-3 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Short Communication
Aguilar, Ignacio
Legarra, Andres
Cardoso, Fernando
Masuda, Yutaka
Lourenco, Daniela
Misztal, Ignacy
Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle
title Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle
title_full Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle
title_fullStr Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle
title_full_unstemmed Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle
title_short Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle
title_sort frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in american angus cattle
topic Short Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6584984/
https://www.ncbi.nlm.nih.gov/pubmed/31221101
http://dx.doi.org/10.1186/s12711-019-0469-3
work_keys_str_mv AT aguilarignacio frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle
AT legarraandres frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle
AT cardosofernando frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle
AT masudayutaka frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle
AT lourencodaniela frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle
AT misztalignacy frequentistpvaluesforlargescalesinglestepgenomewideassociationwithanapplicationtobirthweightinamericananguscattle