Cargando…

Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data

The availability of whole genome sequencing (WGS) data enables the discovery of causative single nucleotide polymorphisms (SNPs) or SNPs in high linkage disequilibrium with causative SNPs. This study investigated effects of integrating SNPs selected from imputed WGS data into the data of 54K chip on...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Aoxing, Lund, Mogens Sandø, Boichard, Didier, Karaman, Emre, Fritz, Sebastien, Aamand, Gert Pedersen, Nielsen, Ulrik Sander, Wang, Yachun, Su, Guosheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6906477/
https://www.ncbi.nlm.nih.gov/pubmed/31278370
http://dx.doi.org/10.1038/s41437-019-0246-7
_version_ 1783478353325457408
author Liu, Aoxing
Lund, Mogens Sandø
Boichard, Didier
Karaman, Emre
Fritz, Sebastien
Aamand, Gert Pedersen
Nielsen, Ulrik Sander
Wang, Yachun
Su, Guosheng
author_facet Liu, Aoxing
Lund, Mogens Sandø
Boichard, Didier
Karaman, Emre
Fritz, Sebastien
Aamand, Gert Pedersen
Nielsen, Ulrik Sander
Wang, Yachun
Su, Guosheng
author_sort Liu, Aoxing
collection PubMed
description The availability of whole genome sequencing (WGS) data enables the discovery of causative single nucleotide polymorphisms (SNPs) or SNPs in high linkage disequilibrium with causative SNPs. This study investigated effects of integrating SNPs selected from imputed WGS data into the data of 54K chip on genomic prediction in Danish Jersey. The WGS SNPs, mainly including peaks of quantitative trait loci, structure variants, regulatory regions of genes, and SNPs within genes with strong effects predicted with variant effect predictor, were selected in previous analyses for dairy breeds in Denmark–Finland–Sweden (DFS) and France (FRA). Animals genotyped with 54K chip, standard LD chip, and customized LD chip which covered selected WGS SNPs and SNPs in the standard LD chip, were imputed to 54K together with DFS and FRA SNPs. Genomic best linear unbiased prediction (GBLUP) and Bayesian four-distribution mixture models considering 54K and selected WGS SNPs as one (a one-component model) or two separate genetic components (a two-component model) were used to predict breeding values. For milk production traits and mastitis, both DFS (0.025) and FRA (0.029) sets of additional WGS SNPs improved reliabilities, and inclusions of all selected WGS SNPs generally achieved highest improvements of reliabilities (0.034). A Bayesian four-distribution model yielded higher reliabilities than a GBLUP model for milk and protein, but extra gains in reliabilities from using selected WGS SNPs were smaller for a Bayesian four-distribution model than a GBLUP model. Generally, no significant difference was observed between one-component and two-component models, except for using GBLUP models for milk.
format Online
Article
Text
id pubmed-6906477
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-69064772019-12-12 Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data Liu, Aoxing Lund, Mogens Sandø Boichard, Didier Karaman, Emre Fritz, Sebastien Aamand, Gert Pedersen Nielsen, Ulrik Sander Wang, Yachun Su, Guosheng Heredity (Edinb) Article The availability of whole genome sequencing (WGS) data enables the discovery of causative single nucleotide polymorphisms (SNPs) or SNPs in high linkage disequilibrium with causative SNPs. This study investigated effects of integrating SNPs selected from imputed WGS data into the data of 54K chip on genomic prediction in Danish Jersey. The WGS SNPs, mainly including peaks of quantitative trait loci, structure variants, regulatory regions of genes, and SNPs within genes with strong effects predicted with variant effect predictor, were selected in previous analyses for dairy breeds in Denmark–Finland–Sweden (DFS) and France (FRA). Animals genotyped with 54K chip, standard LD chip, and customized LD chip which covered selected WGS SNPs and SNPs in the standard LD chip, were imputed to 54K together with DFS and FRA SNPs. Genomic best linear unbiased prediction (GBLUP) and Bayesian four-distribution mixture models considering 54K and selected WGS SNPs as one (a one-component model) or two separate genetic components (a two-component model) were used to predict breeding values. For milk production traits and mastitis, both DFS (0.025) and FRA (0.029) sets of additional WGS SNPs improved reliabilities, and inclusions of all selected WGS SNPs generally achieved highest improvements of reliabilities (0.034). A Bayesian four-distribution model yielded higher reliabilities than a GBLUP model for milk and protein, but extra gains in reliabilities from using selected WGS SNPs were smaller for a Bayesian four-distribution model than a GBLUP model. Generally, no significant difference was observed between one-component and two-component models, except for using GBLUP models for milk. Springer International Publishing 2019-07-05 2020-01 /pmc/articles/PMC6906477/ /pubmed/31278370 http://dx.doi.org/10.1038/s41437-019-0246-7 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Liu, Aoxing
Lund, Mogens Sandø
Boichard, Didier
Karaman, Emre
Fritz, Sebastien
Aamand, Gert Pedersen
Nielsen, Ulrik Sander
Wang, Yachun
Su, Guosheng
Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data
title Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data
title_full Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data
title_fullStr Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data
title_full_unstemmed Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data
title_short Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data
title_sort improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6906477/
https://www.ncbi.nlm.nih.gov/pubmed/31278370
http://dx.doi.org/10.1038/s41437-019-0246-7
work_keys_str_mv AT liuaoxing improvementofgenomicpredictionbyintegratingadditionalsinglenucleotidepolymorphismsselectedfromimputedwholegenomesequencingdata
AT lundmogenssandø improvementofgenomicpredictionbyintegratingadditionalsinglenucleotidepolymorphismsselectedfromimputedwholegenomesequencingdata
AT boicharddidier improvementofgenomicpredictionbyintegratingadditionalsinglenucleotidepolymorphismsselectedfromimputedwholegenomesequencingdata
AT karamanemre improvementofgenomicpredictionbyintegratingadditionalsinglenucleotidepolymorphismsselectedfromimputedwholegenomesequencingdata
AT fritzsebastien improvementofgenomicpredictionbyintegratingadditionalsinglenucleotidepolymorphismsselectedfromimputedwholegenomesequencingdata
AT aamandgertpedersen improvementofgenomicpredictionbyintegratingadditionalsinglenucleotidepolymorphismsselectedfromimputedwholegenomesequencingdata
AT nielsenulriksander improvementofgenomicpredictionbyintegratingadditionalsinglenucleotidepolymorphismsselectedfromimputedwholegenomesequencingdata
AT wangyachun improvementofgenomicpredictionbyintegratingadditionalsinglenucleotidepolymorphismsselectedfromimputedwholegenomesequencingdata
AT suguosheng improvementofgenomicpredictionbyintegratingadditionalsinglenucleotidepolymorphismsselectedfromimputedwholegenomesequencingdata