Cargando…

Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds

BACKGROUND: The objective of the present study was to test the ability of the partial least squares regression technique to impute genotypes from low density single nucleotide polymorphisms (SNP) panels i.e. 3K or 7K to a high density panel with 50K SNP. No pedigree information was used. METHODS: Da...

Descripción completa

Detalles Bibliográficos
Autores principales: Dimauro, Corrado, Cellesi, Massimo, Gaspa, Giustino, Ajmone-Marsan, Paolo, Steri, Roberto, Marras, Gabriele, Macciotta, Nicolò PP
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716726/
https://www.ncbi.nlm.nih.gov/pubmed/23738947
http://dx.doi.org/10.1186/1297-9686-45-15
_version_ 1782277585252646912
author Dimauro, Corrado
Cellesi, Massimo
Gaspa, Giustino
Ajmone-Marsan, Paolo
Steri, Roberto
Marras, Gabriele
Macciotta, Nicolò PP
author_facet Dimauro, Corrado
Cellesi, Massimo
Gaspa, Giustino
Ajmone-Marsan, Paolo
Steri, Roberto
Marras, Gabriele
Macciotta, Nicolò PP
author_sort Dimauro, Corrado
collection PubMed
description BACKGROUND: The objective of the present study was to test the ability of the partial least squares regression technique to impute genotypes from low density single nucleotide polymorphisms (SNP) panels i.e. 3K or 7K to a high density panel with 50K SNP. No pedigree information was used. METHODS: Data consisted of 2093 Holstein, 749 Brown Swiss and 479 Simmental bulls genotyped with the Illumina 50K Beadchip. First, a single-breed approach was applied by using only data from Holstein animals. Then, to enlarge the training population, data from the three breeds were combined and a multi-breed analysis was performed. Accuracies of genotypes imputed using the partial least squares regression method were compared with those obtained by using the Beagle software. The impact of genotype imputation on breeding value prediction was evaluated for milk yield, fat content and protein content. RESULTS: In the single-breed approach, the accuracy of imputation using partial least squares regression was around 90 and 94% for the 3K and 7K platforms, respectively; corresponding accuracies obtained with Beagle were around 85% and 90%. Moreover, computing time required by the partial least squares regression method was on average around 10 times lower than computing time required by Beagle. Using the partial least squares regression method in the multi-breed resulted in lower imputation accuracies than using single-breed data. The impact of the SNP-genotype imputation on the accuracy of direct genomic breeding values was small. The correlation between estimates of genetic merit obtained by using imputed versus actual genotypes was around 0.96 for the 7K chip. CONCLUSIONS: Results of the present work suggested that the partial least squares regression imputation method could be useful to impute SNP genotypes when pedigree information is not available.
format Online
Article
Text
id pubmed-3716726
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37167262013-07-22 Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds Dimauro, Corrado Cellesi, Massimo Gaspa, Giustino Ajmone-Marsan, Paolo Steri, Roberto Marras, Gabriele Macciotta, Nicolò PP Genet Sel Evol Research BACKGROUND: The objective of the present study was to test the ability of the partial least squares regression technique to impute genotypes from low density single nucleotide polymorphisms (SNP) panels i.e. 3K or 7K to a high density panel with 50K SNP. No pedigree information was used. METHODS: Data consisted of 2093 Holstein, 749 Brown Swiss and 479 Simmental bulls genotyped with the Illumina 50K Beadchip. First, a single-breed approach was applied by using only data from Holstein animals. Then, to enlarge the training population, data from the three breeds were combined and a multi-breed analysis was performed. Accuracies of genotypes imputed using the partial least squares regression method were compared with those obtained by using the Beagle software. The impact of genotype imputation on breeding value prediction was evaluated for milk yield, fat content and protein content. RESULTS: In the single-breed approach, the accuracy of imputation using partial least squares regression was around 90 and 94% for the 3K and 7K platforms, respectively; corresponding accuracies obtained with Beagle were around 85% and 90%. Moreover, computing time required by the partial least squares regression method was on average around 10 times lower than computing time required by Beagle. Using the partial least squares regression method in the multi-breed resulted in lower imputation accuracies than using single-breed data. The impact of the SNP-genotype imputation on the accuracy of direct genomic breeding values was small. The correlation between estimates of genetic merit obtained by using imputed versus actual genotypes was around 0.96 for the 7K chip. CONCLUSIONS: Results of the present work suggested that the partial least squares regression imputation method could be useful to impute SNP genotypes when pedigree information is not available. BioMed Central 2013-06-05 /pmc/articles/PMC3716726/ /pubmed/23738947 http://dx.doi.org/10.1186/1297-9686-45-15 Text en Copyright © 2013 Dimauro et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Dimauro, Corrado
Cellesi, Massimo
Gaspa, Giustino
Ajmone-Marsan, Paolo
Steri, Roberto
Marras, Gabriele
Macciotta, Nicolò PP
Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds
title Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds
title_full Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds
title_fullStr Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds
title_full_unstemmed Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds
title_short Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds
title_sort use of partial least squares regression to impute snp genotypes in italian cattle breeds
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716726/
https://www.ncbi.nlm.nih.gov/pubmed/23738947
http://dx.doi.org/10.1186/1297-9686-45-15
work_keys_str_mv AT dimaurocorrado useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds
AT cellesimassimo useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds
AT gaspagiustino useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds
AT ajmonemarsanpaolo useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds
AT steriroberto useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds
AT marrasgabriele useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds
AT macciottanicolopp useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds