Cargando…

Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms

BACKGROUND: It becomes clear that the increase in the density of marker panels and even the use of sequence data didn’t result in any meaningful increase in the accuracy of genomic selection (GS) using either regression (RM) or variance component (VC) approaches. This is in part due to the limitatio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chang, Ling-Yun, Toghiani, Sajjad, Aggrey, Samuel E., Rekaya, Romdhane
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6387489/ https://www.ncbi.nlm.nih.gov/pubmed/30795734 http://dx.doi.org/10.1186/s12863-019-0720-5

_version_	1783397593374523392
author	Chang, Ling-Yun Toghiani, Sajjad Aggrey, Samuel E. Rekaya, Romdhane
author_facet	Chang, Ling-Yun Toghiani, Sajjad Aggrey, Samuel E. Rekaya, Romdhane
author_sort	Chang, Ling-Yun
collection	PubMed
description	BACKGROUND: It becomes clear that the increase in the density of marker panels and even the use of sequence data didn’t result in any meaningful increase in the accuracy of genomic selection (GS) using either regression (RM) or variance component (VC) approaches. This is in part due to the limitations of current methods. Association model are well over-parameterized and suffer from severe co-linearity and lack of statistical power. Even when the variant effects are not directly estimated using VC based approaches, the genomic relationships didn’t improve after the marker density exceeded a certain threshold. SNP prioritization-based fixation index (F(ST)) scores were used to track the majority of significant QTL and to reduce the dimensionality of the association model. RESULTS: Two populations with average LD between adjacent markers of 0.3 (P1) and 0.7 (P2) were simulated. In both populations, the genomic data consisted of 400 K SNP markers distributed on 10 chromosomes. The density of simulated genomic data mimics roughly 1.2 million SNP markers in the bovine genome. The genomic relationship matrix (G) was calculated for each set of selected SNPs based on their F(ST) score and similar numbers of SNPs were selected randomly for comparison. Using all 400 K SNPs, 46% of the off-diagonal elements (OD) were between − 0.01 and 0.01. The same portion was 31, 23 and 16% when 80 K, 40 K and 20 K SNPs were selected based on F(ST) scores. For randomly selected 20 K SNP subsets, around 33% of the OD fell within the same range. Genomic similarity computed using SNPs selected based on F(ST) scores was always higher than using the same number of SNPs selected randomly. Maximum accuracies of 0.741 and 0.828 were achieved when 20 and 10 K SNPs were selected based on F(ST) scores in P(1) and P(2), respectively. CONCLUSIONS: Genomic similarity could be maximized by the decrease in the number of selected SNPs, but it also leads to a decrease in the percentage of genetic variation explained by the selected markers. Finding the balance between these two parameters could optimize the accuracy of GS in the presence of high density marker panels. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12863-019-0720-5) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-6387489
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-63874892019-03-04 Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms Chang, Ling-Yun Toghiani, Sajjad Aggrey, Samuel E. Rekaya, Romdhane BMC Genet Research Article BACKGROUND: It becomes clear that the increase in the density of marker panels and even the use of sequence data didn’t result in any meaningful increase in the accuracy of genomic selection (GS) using either regression (RM) or variance component (VC) approaches. This is in part due to the limitations of current methods. Association model are well over-parameterized and suffer from severe co-linearity and lack of statistical power. Even when the variant effects are not directly estimated using VC based approaches, the genomic relationships didn’t improve after the marker density exceeded a certain threshold. SNP prioritization-based fixation index (F(ST)) scores were used to track the majority of significant QTL and to reduce the dimensionality of the association model. RESULTS: Two populations with average LD between adjacent markers of 0.3 (P1) and 0.7 (P2) were simulated. In both populations, the genomic data consisted of 400 K SNP markers distributed on 10 chromosomes. The density of simulated genomic data mimics roughly 1.2 million SNP markers in the bovine genome. The genomic relationship matrix (G) was calculated for each set of selected SNPs based on their F(ST) score and similar numbers of SNPs were selected randomly for comparison. Using all 400 K SNPs, 46% of the off-diagonal elements (OD) were between − 0.01 and 0.01. The same portion was 31, 23 and 16% when 80 K, 40 K and 20 K SNPs were selected based on F(ST) scores. For randomly selected 20 K SNP subsets, around 33% of the OD fell within the same range. Genomic similarity computed using SNPs selected based on F(ST) scores was always higher than using the same number of SNPs selected randomly. Maximum accuracies of 0.741 and 0.828 were achieved when 20 and 10 K SNPs were selected based on F(ST) scores in P(1) and P(2), respectively. CONCLUSIONS: Genomic similarity could be maximized by the decrease in the number of selected SNPs, but it also leads to a decrease in the percentage of genetic variation explained by the selected markers. Finding the balance between these two parameters could optimize the accuracy of GS in the presence of high density marker panels. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12863-019-0720-5) contains supplementary material, which is available to authorized users. BioMed Central 2019-02-22 /pmc/articles/PMC6387489/ /pubmed/30795734 http://dx.doi.org/10.1186/s12863-019-0720-5 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Chang, Ling-Yun Toghiani, Sajjad Aggrey, Samuel E. Rekaya, Romdhane Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms
title	Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms
title_full	Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms
title_fullStr	Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms
title_full_unstemmed	Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms
title_short	Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms
title_sort	increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6387489/ https://www.ncbi.nlm.nih.gov/pubmed/30795734 http://dx.doi.org/10.1186/s12863-019-0720-5
work_keys_str_mv	AT changlingyun increasingaccuracyofgenomicselectioninpresenceofhighdensitymarkerpanelsthroughtheprioritizationofrelevantpolymorphisms AT toghianisajjad increasingaccuracyofgenomicselectioninpresenceofhighdensitymarkerpanelsthroughtheprioritizationofrelevantpolymorphisms AT aggreysamuele increasingaccuracyofgenomicselectioninpresenceofhighdensitymarkerpanelsthroughtheprioritizationofrelevantpolymorphisms AT rekayaromdhane increasingaccuracyofgenomicselectioninpresenceofhighdensitymarkerpanelsthroughtheprioritizationofrelevantpolymorphisms

Increasing accuracy of genomic selection in presence of high density marker panels through the prioritization of relevant polymorphisms

Ejemplares similares