Cargando…

Pre-selection of most significant SNPS for the estimation of genomic breeding values

The availability of a large amount of SNP markers throughout the genome of different livestock species offers the opportunity to estimate genomic breeding values (GEBVs). However, the estimation of many effects in a data set of limited size represent a severe statistical problem. A pre-selection of...

Descripción completa

Detalles Bibliográficos
Autores principales: Macciotta, Nicolò PP, Gaspa, Giustino, Steri, Roberto, Pieramati, Camillo, Carnier, Paolo, Dimauro, Corrado
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654495/
https://www.ncbi.nlm.nih.gov/pubmed/19278540
_version_ 1782165375042977792
author Macciotta, Nicolò PP
Gaspa, Giustino
Steri, Roberto
Pieramati, Camillo
Carnier, Paolo
Dimauro, Corrado
author_facet Macciotta, Nicolò PP
Gaspa, Giustino
Steri, Roberto
Pieramati, Camillo
Carnier, Paolo
Dimauro, Corrado
author_sort Macciotta, Nicolò PP
collection PubMed
description The availability of a large amount of SNP markers throughout the genome of different livestock species offers the opportunity to estimate genomic breeding values (GEBVs). However, the estimation of many effects in a data set of limited size represent a severe statistical problem. A pre-selection of SNPS based on single regression may provide a reasonable compromise between accuracy of results, number of independent variables to be considered and computing requirements. A total of 595 and 618 SNPS were pre-selected using a simple linear regression for each SNP, based on phenotypes or polygenic EBVs, respectively, with an average distance of 9–10 cM between them. Chromosome four had the largest frequency of selected SNPS. Average correlations between GEBVs and TBVs were about 0.82 and 0.73 for the TRAINING generations when phenotypes or polygenic EBVs were considered as dependent variable, whereas they tend to decrease to 0.66 and 0.54 for the PREDICTION generations. The pre-selection of SNPs using the phenotypes as dependent variable together with a BLUP estimation of marker genotype effects using a variance contribution of each marker equal to σ(2)(a)/n(snps )resulted in a remarkable accuracy of GEBV estimation (0.77) in the PREDICTION generations.
format Text
id pubmed-2654495
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26544952009-03-13 Pre-selection of most significant SNPS for the estimation of genomic breeding values Macciotta, Nicolò PP Gaspa, Giustino Steri, Roberto Pieramati, Camillo Carnier, Paolo Dimauro, Corrado BMC Proc Proceedings The availability of a large amount of SNP markers throughout the genome of different livestock species offers the opportunity to estimate genomic breeding values (GEBVs). However, the estimation of many effects in a data set of limited size represent a severe statistical problem. A pre-selection of SNPS based on single regression may provide a reasonable compromise between accuracy of results, number of independent variables to be considered and computing requirements. A total of 595 and 618 SNPS were pre-selected using a simple linear regression for each SNP, based on phenotypes or polygenic EBVs, respectively, with an average distance of 9–10 cM between them. Chromosome four had the largest frequency of selected SNPS. Average correlations between GEBVs and TBVs were about 0.82 and 0.73 for the TRAINING generations when phenotypes or polygenic EBVs were considered as dependent variable, whereas they tend to decrease to 0.66 and 0.54 for the PREDICTION generations. The pre-selection of SNPs using the phenotypes as dependent variable together with a BLUP estimation of marker genotype effects using a variance contribution of each marker equal to σ(2)(a)/n(snps )resulted in a remarkable accuracy of GEBV estimation (0.77) in the PREDICTION generations. BioMed Central 2009-02-23 /pmc/articles/PMC2654495/ /pubmed/19278540 Text en Copyright © 2009 Macciotta et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Macciotta, Nicolò PP
Gaspa, Giustino
Steri, Roberto
Pieramati, Camillo
Carnier, Paolo
Dimauro, Corrado
Pre-selection of most significant SNPS for the estimation of genomic breeding values
title Pre-selection of most significant SNPS for the estimation of genomic breeding values
title_full Pre-selection of most significant SNPS for the estimation of genomic breeding values
title_fullStr Pre-selection of most significant SNPS for the estimation of genomic breeding values
title_full_unstemmed Pre-selection of most significant SNPS for the estimation of genomic breeding values
title_short Pre-selection of most significant SNPS for the estimation of genomic breeding values
title_sort pre-selection of most significant snps for the estimation of genomic breeding values
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654495/
https://www.ncbi.nlm.nih.gov/pubmed/19278540
work_keys_str_mv AT macciottanicolopp preselectionofmostsignificantsnpsfortheestimationofgenomicbreedingvalues
AT gaspagiustino preselectionofmostsignificantsnpsfortheestimationofgenomicbreedingvalues
AT steriroberto preselectionofmostsignificantsnpsfortheestimationofgenomicbreedingvalues
AT pieramaticamillo preselectionofmostsignificantsnpsfortheestimationofgenomicbreedingvalues
AT carnierpaolo preselectionofmostsignificantsnpsfortheestimationofgenomicbreedingvalues
AT dimaurocorrado preselectionofmostsignificantsnpsfortheestimationofgenomicbreedingvalues