Cargando…

QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection

BACKGROUND: National genetic evaluations for disease resistance do not exist, precluding the genetic improvement of cattle for these traits. We imputed BovineHD genotypes to whole genome sequence for 2703 Holsteins that were cases or controls for Bovine Respiratory Disease and sampled from either Ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoff, Jesse L., Decker, Jared E., Schnabel, Robert D., Seabury, Christopher M., Neibergs, Holly L., Taylor, Jeremy F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612181/
https://www.ncbi.nlm.nih.gov/pubmed/31277567
http://dx.doi.org/10.1186/s12864-019-5941-5
_version_ 1783432842162733056
author Hoff, Jesse L.
Decker, Jared E.
Schnabel, Robert D.
Seabury, Christopher M.
Neibergs, Holly L.
Taylor, Jeremy F.
author_facet Hoff, Jesse L.
Decker, Jared E.
Schnabel, Robert D.
Seabury, Christopher M.
Neibergs, Holly L.
Taylor, Jeremy F.
author_sort Hoff, Jesse L.
collection PubMed
description BACKGROUND: National genetic evaluations for disease resistance do not exist, precluding the genetic improvement of cattle for these traits. We imputed BovineHD genotypes to whole genome sequence for 2703 Holsteins that were cases or controls for Bovine Respiratory Disease and sampled from either California or New Mexico to construct and compare genomic prediction models. The sequence variation reference dataset comprised variants called for 1578 animals from Run 5 of the 1000 Bull Genomes Project, including 450 Holsteins and 29 animals sequenced from this study population. Genotypes for 9,282,726 variants with minor allele frequencies ≥5% were imputed and used to obtain genomic predictions in GEMMA using a Bayesian Sparse Linear Mixed Model. RESULTS: Variation explained by markers increased from 13.6% using BovineHD data to 14.4% using imputed whole genome sequence data and the resolution of genomic regions detected as harbouring QTL substantially increased. Explained variation in the analysis of the combined California and New Mexico data was less than when data for each state were separately analysed and the estimated genetic correlation between risk of Bovine Respiratory Disease in California and New Mexico Holsteins was − 0.36. Consequently, genomic predictions trained using the data from one state did not accurately predict disease risk in the other state. To determine if a prediction model could be developed with utility in both states, we selected variants within genomic regions harbouring: 1) genes involved in the normal immune response to infection by pathogens responsible for Bovine Respiratory Disease detected by RNA-Seq analysis, and/or 2) QTL identified in the association analysis of the imputed sequence variants. The model based on QTL selected variants is biased but when trained in one state generated BRD risk predictions with positive accuracies in the other state. CONCLUSIONS: We demonstrate the utility of sequence-based and biology-driven model development for genomic selection. Disease phenotypes cannot be routinely recorded in most livestock species and the observed phenotypes may vary in their genomic architecture due to variation in the pathogen composition across environments. Elucidation of trait biology and genetic architecture may guide the development of prediction models with utility across breeds and environments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5941-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6612181
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66121812019-07-16 QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection Hoff, Jesse L. Decker, Jared E. Schnabel, Robert D. Seabury, Christopher M. Neibergs, Holly L. Taylor, Jeremy F. BMC Genomics Research Article BACKGROUND: National genetic evaluations for disease resistance do not exist, precluding the genetic improvement of cattle for these traits. We imputed BovineHD genotypes to whole genome sequence for 2703 Holsteins that were cases or controls for Bovine Respiratory Disease and sampled from either California or New Mexico to construct and compare genomic prediction models. The sequence variation reference dataset comprised variants called for 1578 animals from Run 5 of the 1000 Bull Genomes Project, including 450 Holsteins and 29 animals sequenced from this study population. Genotypes for 9,282,726 variants with minor allele frequencies ≥5% were imputed and used to obtain genomic predictions in GEMMA using a Bayesian Sparse Linear Mixed Model. RESULTS: Variation explained by markers increased from 13.6% using BovineHD data to 14.4% using imputed whole genome sequence data and the resolution of genomic regions detected as harbouring QTL substantially increased. Explained variation in the analysis of the combined California and New Mexico data was less than when data for each state were separately analysed and the estimated genetic correlation between risk of Bovine Respiratory Disease in California and New Mexico Holsteins was − 0.36. Consequently, genomic predictions trained using the data from one state did not accurately predict disease risk in the other state. To determine if a prediction model could be developed with utility in both states, we selected variants within genomic regions harbouring: 1) genes involved in the normal immune response to infection by pathogens responsible for Bovine Respiratory Disease detected by RNA-Seq analysis, and/or 2) QTL identified in the association analysis of the imputed sequence variants. The model based on QTL selected variants is biased but when trained in one state generated BRD risk predictions with positive accuracies in the other state. CONCLUSIONS: We demonstrate the utility of sequence-based and biology-driven model development for genomic selection. Disease phenotypes cannot be routinely recorded in most livestock species and the observed phenotypes may vary in their genomic architecture due to variation in the pathogen composition across environments. Elucidation of trait biology and genetic architecture may guide the development of prediction models with utility across breeds and environments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5941-5) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-05 /pmc/articles/PMC6612181/ /pubmed/31277567 http://dx.doi.org/10.1186/s12864-019-5941-5 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Hoff, Jesse L.
Decker, Jared E.
Schnabel, Robert D.
Seabury, Christopher M.
Neibergs, Holly L.
Taylor, Jeremy F.
QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection
title QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection
title_full QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection
title_fullStr QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection
title_full_unstemmed QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection
title_short QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection
title_sort qtl-mapping and genomic prediction for bovine respiratory disease in u.s. holsteins using sequence imputation and feature selection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612181/
https://www.ncbi.nlm.nih.gov/pubmed/31277567
http://dx.doi.org/10.1186/s12864-019-5941-5
work_keys_str_mv AT hoffjessel qtlmappingandgenomicpredictionforbovinerespiratorydiseaseinusholsteinsusingsequenceimputationandfeatureselection
AT deckerjarede qtlmappingandgenomicpredictionforbovinerespiratorydiseaseinusholsteinsusingsequenceimputationandfeatureselection
AT schnabelrobertd qtlmappingandgenomicpredictionforbovinerespiratorydiseaseinusholsteinsusingsequenceimputationandfeatureselection
AT seaburychristopherm qtlmappingandgenomicpredictionforbovinerespiratorydiseaseinusholsteinsusingsequenceimputationandfeatureselection
AT neibergshollyl qtlmappingandgenomicpredictionforbovinerespiratorydiseaseinusholsteinsusingsequenceimputationandfeatureselection
AT taylorjeremyf qtlmappingandgenomicpredictionforbovinerespiratorydiseaseinusholsteinsusingsequenceimputationandfeatureselection