Cargando…

Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data

BACKGROUND: The aim of this study was to estimate haplotype effects and then to predict breeding values using linear models. The haplotype based analysis enables avoidance of loosing information due to linkage disequilibrium between single markers. There are also less explanatory variables in the li...

Descripción completa

Detalles Bibliográficos
Autores principales: Mucha, Anna, Wierzbicki, Heliodor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3363153/
https://www.ncbi.nlm.nih.gov/pubmed/22640464
http://dx.doi.org/10.1186/1753-6561-6-S2-S11
_version_ 1782234304614498304
author Mucha, Anna
Wierzbicki, Heliodor
author_facet Mucha, Anna
Wierzbicki, Heliodor
author_sort Mucha, Anna
collection PubMed
description BACKGROUND: The aim of this study was to estimate haplotype effects and then to predict breeding values using linear models. The haplotype based analysis enables avoidance of loosing information due to linkage disequilibrium between single markers. There are also less explanatory variables in the linear model which makes the estimation more reliable. METHODS: Different methods and criteria for marker and haplotype selection were considered. First, markers with MAF lower than 5% where excluded from the data set. Then, SNPs in complete linkage disequilibrium where selected. Next step was to construct haplotypes and to estimate their frequencies basing on selected SNPs. The haplotypes with a frequency lower than 1% were not considered in further analysis. Chosen haplotypes were used as the explanatory variables in the linear models for breeding values prediction. Linear models with fixed and random haplotype effects as well as animal model were tested. RESULTS: The number of markers was limited to 1206, 1189, 1249, 1288 and 1167 for chromosome 1, 2, 3, 4 and 5, respectively due to MAF criterion. In total 409 subsets of SNPs with r(2)=1 were found. 1476 haplotypes with different lengths were inferred. The frequencies of 817 haplotypes were higher than 1% - 184 for the first chromosome, 172 for the second, 131 for the third, 146 for the forth and 184 haplotypes for the fifth chromosome. The haplotype effects estimated using random models were comparable and more precise in prediction for individuals with unknown phenotypes. A few haplotypes with large effects were found when their effects were defined as fixed in the linear model . The correlations of the predicted breeding values with true breeding values were not that high. This could be brought about by selection criteria imposed on the genotype data which led to substantial reduction of number of markers. CONCLUSIONS: Although not many markers were considered in the study, the results obtained show that the implemented approach can be considered as quite promising. The haplotype approach let to avoid high dimensional models as compared with single SNPs models.
format Online
Article
Text
id pubmed-3363153
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33631532012-06-01 Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data Mucha, Anna Wierzbicki, Heliodor BMC Proc Proceedings BACKGROUND: The aim of this study was to estimate haplotype effects and then to predict breeding values using linear models. The haplotype based analysis enables avoidance of loosing information due to linkage disequilibrium between single markers. There are also less explanatory variables in the linear model which makes the estimation more reliable. METHODS: Different methods and criteria for marker and haplotype selection were considered. First, markers with MAF lower than 5% where excluded from the data set. Then, SNPs in complete linkage disequilibrium where selected. Next step was to construct haplotypes and to estimate their frequencies basing on selected SNPs. The haplotypes with a frequency lower than 1% were not considered in further analysis. Chosen haplotypes were used as the explanatory variables in the linear models for breeding values prediction. Linear models with fixed and random haplotype effects as well as animal model were tested. RESULTS: The number of markers was limited to 1206, 1189, 1249, 1288 and 1167 for chromosome 1, 2, 3, 4 and 5, respectively due to MAF criterion. In total 409 subsets of SNPs with r(2)=1 were found. 1476 haplotypes with different lengths were inferred. The frequencies of 817 haplotypes were higher than 1% - 184 for the first chromosome, 172 for the second, 131 for the third, 146 for the forth and 184 haplotypes for the fifth chromosome. The haplotype effects estimated using random models were comparable and more precise in prediction for individuals with unknown phenotypes. A few haplotypes with large effects were found when their effects were defined as fixed in the linear model . The correlations of the predicted breeding values with true breeding values were not that high. This could be brought about by selection criteria imposed on the genotype data which led to substantial reduction of number of markers. CONCLUSIONS: Although not many markers were considered in the study, the results obtained show that the implemented approach can be considered as quite promising. The haplotype approach let to avoid high dimensional models as compared with single SNPs models. BioMed Central 2012-05-21 /pmc/articles/PMC3363153/ /pubmed/22640464 http://dx.doi.org/10.1186/1753-6561-6-S2-S11 Text en Copyright ©2012 Mucha and Wierzbicki; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Mucha, Anna
Wierzbicki, Heliodor
Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data
title Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data
title_full Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data
title_fullStr Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data
title_full_unstemmed Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data
title_short Linear models for breeding values prediction in haplotype-assisted selection - an analysis of QTL-MAS Workshop 2011 Data
title_sort linear models for breeding values prediction in haplotype-assisted selection - an analysis of qtl-mas workshop 2011 data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3363153/
https://www.ncbi.nlm.nih.gov/pubmed/22640464
http://dx.doi.org/10.1186/1753-6561-6-S2-S11
work_keys_str_mv AT muchaanna linearmodelsforbreedingvaluespredictioninhaplotypeassistedselectionananalysisofqtlmasworkshop2011data
AT wierzbickiheliodor linearmodelsforbreedingvaluespredictioninhaplotypeassistedselectionananalysisofqtlmasworkshop2011data