Cargando…
Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data
BACKGROUND: The aim of this study was to estimate haplotype effects and then to predict breeding values using linear models. The haplotype based analysis enables avoidance of loosing information due to linkage disequilibrium between single markers. There are also less explanatory variables in the li...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3363153/ https://www.ncbi.nlm.nih.gov/pubmed/22640464 http://dx.doi.org/10.1186/1753-6561-6-S2-S11 |
_version_ | 1782234304614498304 |
---|---|
author | Mucha, Anna Wierzbicki, Heliodor |
author_facet | Mucha, Anna Wierzbicki, Heliodor |
author_sort | Mucha, Anna |
collection | PubMed |
description | BACKGROUND: The aim of this study was to estimate haplotype effects and then to predict breeding values using linear models. The haplotype based analysis enables avoidance of loosing information due to linkage disequilibrium between single markers. There are also less explanatory variables in the linear model which makes the estimation more reliable. METHODS: Different methods and criteria for marker and haplotype selection were considered. First, markers with MAF lower than 5% where excluded from the data set. Then, SNPs in complete linkage disequilibrium where selected. Next step was to construct haplotypes and to estimate their frequencies basing on selected SNPs. The haplotypes with a frequency lower than 1% were not considered in further analysis. Chosen haplotypes were used as the explanatory variables in the linear models for breeding values prediction. Linear models with fixed and random haplotype effects as well as animal model were tested. RESULTS: The number of markers was limited to 1206, 1189, 1249, 1288 and 1167 for chromosome 1, 2, 3, 4 and 5, respectively due to MAF criterion. In total 409 subsets of SNPs with r(2)=1 were found. 1476 haplotypes with different lengths were inferred. The frequencies of 817 haplotypes were higher than 1% - 184 for the first chromosome, 172 for the second, 131 for the third, 146 for the forth and 184 haplotypes for the fifth chromosome. The haplotype effects estimated using random models were comparable and more precise in prediction for individuals with unknown phenotypes. A few haplotypes with large effects were found when their effects were defined as fixed in the linear model . The correlations of the predicted breeding values with true breeding values were not that high. This could be brought about by selection criteria imposed on the genotype data which led to substantial reduction of number of markers. CONCLUSIONS: Although not many markers were considered in the study, the results obtained show that the implemented approach can be considered as quite promising. The haplotype approach let to avoid high dimensional models as compared with single SNPs models. |
format | Online Article Text |
id | pubmed-3363153 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-33631532012-06-01 Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data Mucha, Anna Wierzbicki, Heliodor BMC Proc Proceedings BACKGROUND: The aim of this study was to estimate haplotype effects and then to predict breeding values using linear models. The haplotype based analysis enables avoidance of loosing information due to linkage disequilibrium between single markers. There are also less explanatory variables in the linear model which makes the estimation more reliable. METHODS: Different methods and criteria for marker and haplotype selection were considered. First, markers with MAF lower than 5% where excluded from the data set. Then, SNPs in complete linkage disequilibrium where selected. Next step was to construct haplotypes and to estimate their frequencies basing on selected SNPs. The haplotypes with a frequency lower than 1% were not considered in further analysis. Chosen haplotypes were used as the explanatory variables in the linear models for breeding values prediction. Linear models with fixed and random haplotype effects as well as animal model were tested. RESULTS: The number of markers was limited to 1206, 1189, 1249, 1288 and 1167 for chromosome 1, 2, 3, 4 and 5, respectively due to MAF criterion. In total 409 subsets of SNPs with r(2)=1 were found. 1476 haplotypes with different lengths were inferred. The frequencies of 817 haplotypes were higher than 1% - 184 for the first chromosome, 172 for the second, 131 for the third, 146 for the forth and 184 haplotypes for the fifth chromosome. The haplotype effects estimated using random models were comparable and more precise in prediction for individuals with unknown phenotypes. A few haplotypes with large effects were found when their effects were defined as fixed in the linear model . The correlations of the predicted breeding values with true breeding values were not that high. This could be brought about by selection criteria imposed on the genotype data which led to substantial reduction of number of markers. CONCLUSIONS: Although not many markers were considered in the study, the results obtained show that the implemented approach can be considered as quite promising. The haplotype approach let to avoid high dimensional models as compared with single SNPs models. BioMed Central 2012-05-21 /pmc/articles/PMC3363153/ /pubmed/22640464 http://dx.doi.org/10.1186/1753-6561-6-S2-S11 Text en Copyright ©2012 Mucha and Wierzbicki; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Mucha, Anna Wierzbicki, Heliodor Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data |
title | Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data |
title_full | Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data |
title_fullStr | Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data |
title_full_unstemmed | Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data |
title_short | Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data |
title_sort | linear models for breeding values prediction in haplotype-assisted selection - an analysis of qtl-mas workshop 2011 data |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3363153/ https://www.ncbi.nlm.nih.gov/pubmed/22640464 http://dx.doi.org/10.1186/1753-6561-6-S2-S11 |
work_keys_str_mv | AT muchaanna linearmodelsforbreedingvaluespredictioninhaplotypeassistedselectionananalysisofqtlmasworkshop2011data AT wierzbickiheliodor linearmodelsforbreedingvaluespredictioninhaplotypeassistedselectionananalysisofqtlmasworkshop2011data |