Cargando…
Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds
BACKGROUND: The objective of the present study was to test the ability of the partial least squares regression technique to impute genotypes from low density single nucleotide polymorphisms (SNP) panels i.e. 3K or 7K to a high density panel with 50K SNP. No pedigree information was used. METHODS: Da...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716726/ https://www.ncbi.nlm.nih.gov/pubmed/23738947 http://dx.doi.org/10.1186/1297-9686-45-15 |
_version_ | 1782277585252646912 |
---|---|
author | Dimauro, Corrado Cellesi, Massimo Gaspa, Giustino Ajmone-Marsan, Paolo Steri, Roberto Marras, Gabriele Macciotta, Nicolò PP |
author_facet | Dimauro, Corrado Cellesi, Massimo Gaspa, Giustino Ajmone-Marsan, Paolo Steri, Roberto Marras, Gabriele Macciotta, Nicolò PP |
author_sort | Dimauro, Corrado |
collection | PubMed |
description | BACKGROUND: The objective of the present study was to test the ability of the partial least squares regression technique to impute genotypes from low density single nucleotide polymorphisms (SNP) panels i.e. 3K or 7K to a high density panel with 50K SNP. No pedigree information was used. METHODS: Data consisted of 2093 Holstein, 749 Brown Swiss and 479 Simmental bulls genotyped with the Illumina 50K Beadchip. First, a single-breed approach was applied by using only data from Holstein animals. Then, to enlarge the training population, data from the three breeds were combined and a multi-breed analysis was performed. Accuracies of genotypes imputed using the partial least squares regression method were compared with those obtained by using the Beagle software. The impact of genotype imputation on breeding value prediction was evaluated for milk yield, fat content and protein content. RESULTS: In the single-breed approach, the accuracy of imputation using partial least squares regression was around 90 and 94% for the 3K and 7K platforms, respectively; corresponding accuracies obtained with Beagle were around 85% and 90%. Moreover, computing time required by the partial least squares regression method was on average around 10 times lower than computing time required by Beagle. Using the partial least squares regression method in the multi-breed resulted in lower imputation accuracies than using single-breed data. The impact of the SNP-genotype imputation on the accuracy of direct genomic breeding values was small. The correlation between estimates of genetic merit obtained by using imputed versus actual genotypes was around 0.96 for the 7K chip. CONCLUSIONS: Results of the present work suggested that the partial least squares regression imputation method could be useful to impute SNP genotypes when pedigree information is not available. |
format | Online Article Text |
id | pubmed-3716726 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-37167262013-07-22 Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds Dimauro, Corrado Cellesi, Massimo Gaspa, Giustino Ajmone-Marsan, Paolo Steri, Roberto Marras, Gabriele Macciotta, Nicolò PP Genet Sel Evol Research BACKGROUND: The objective of the present study was to test the ability of the partial least squares regression technique to impute genotypes from low density single nucleotide polymorphisms (SNP) panels i.e. 3K or 7K to a high density panel with 50K SNP. No pedigree information was used. METHODS: Data consisted of 2093 Holstein, 749 Brown Swiss and 479 Simmental bulls genotyped with the Illumina 50K Beadchip. First, a single-breed approach was applied by using only data from Holstein animals. Then, to enlarge the training population, data from the three breeds were combined and a multi-breed analysis was performed. Accuracies of genotypes imputed using the partial least squares regression method were compared with those obtained by using the Beagle software. The impact of genotype imputation on breeding value prediction was evaluated for milk yield, fat content and protein content. RESULTS: In the single-breed approach, the accuracy of imputation using partial least squares regression was around 90 and 94% for the 3K and 7K platforms, respectively; corresponding accuracies obtained with Beagle were around 85% and 90%. Moreover, computing time required by the partial least squares regression method was on average around 10 times lower than computing time required by Beagle. Using the partial least squares regression method in the multi-breed resulted in lower imputation accuracies than using single-breed data. The impact of the SNP-genotype imputation on the accuracy of direct genomic breeding values was small. The correlation between estimates of genetic merit obtained by using imputed versus actual genotypes was around 0.96 for the 7K chip. CONCLUSIONS: Results of the present work suggested that the partial least squares regression imputation method could be useful to impute SNP genotypes when pedigree information is not available. BioMed Central 2013-06-05 /pmc/articles/PMC3716726/ /pubmed/23738947 http://dx.doi.org/10.1186/1297-9686-45-15 Text en Copyright © 2013 Dimauro et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Dimauro, Corrado Cellesi, Massimo Gaspa, Giustino Ajmone-Marsan, Paolo Steri, Roberto Marras, Gabriele Macciotta, Nicolò PP Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds |
title | Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds |
title_full | Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds |
title_fullStr | Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds |
title_full_unstemmed | Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds |
title_short | Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds |
title_sort | use of partial least squares regression to impute snp genotypes in italian cattle breeds |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716726/ https://www.ncbi.nlm.nih.gov/pubmed/23738947 http://dx.doi.org/10.1186/1297-9686-45-15 |
work_keys_str_mv | AT dimaurocorrado useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds AT cellesimassimo useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds AT gaspagiustino useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds AT ajmonemarsanpaolo useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds AT steriroberto useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds AT marrasgabriele useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds AT macciottanicolopp useofpartialleastsquaresregressiontoimputesnpgenotypesinitaliancattlebreeds |