Cargando…

Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions

Naturally and artificially selected populations usually exhibit some degree of stratification. In Genome-Wide Association Studies and in Whole-Genome Regressions (WGR) analyses, population stratification has been either ignored or dealt with as a potential confounder. However, systematic differences...

Descripción completa

Detalles Bibliográficos
Autores principales: de los Campos, Gustavo, Veturi, Yogasudha, Vazquez, Ana I., Lehermeier, Christina, Pérez-Rodríguez, Paulino
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4666286/
https://www.ncbi.nlm.nih.gov/pubmed/26660276
http://dx.doi.org/10.1007/s13253-015-0222-5
_version_ 1782403688900329472
author de los Campos, Gustavo
Veturi, Yogasudha
Vazquez, Ana I.
Lehermeier, Christina
Pérez-Rodríguez, Paulino
author_facet de los Campos, Gustavo
Veturi, Yogasudha
Vazquez, Ana I.
Lehermeier, Christina
Pérez-Rodríguez, Paulino
author_sort de los Campos, Gustavo
collection PubMed
description Naturally and artificially selected populations usually exhibit some degree of stratification. In Genome-Wide Association Studies and in Whole-Genome Regressions (WGR) analyses, population stratification has been either ignored or dealt with as a potential confounder. However, systematic differences in allele frequency and in patterns of linkage disequilibrium can induce sub-population-specific effects. From this perspective, structure acts as an effect modifier rather than as a confounder. In this article, we extend WGR models commonly used in plant and animal breeding to allow for sub-population-specific effects. This is achieved by decomposing marker effects into main effects and interaction components that describe group-specific deviations. The model can be used both with variable selection and shrinkage methods and can be implemented using existing software for genomic selection. Using a wheat and a pig breeding data set, we compare parameter estimates and the prediction accuracy of the interaction WGR model with WGR analysis ignoring population stratification (across-group analysis) and with a stratified (i.e., within-sub-population) WGR analysis. The interaction model renders trait-specific estimates of the average correlation of effects between sub-populations; we find that such correlation not only depends on the extent of genetic differentiation in allele frequencies between groups but also varies among traits. The evaluation of prediction accuracy shows a modest superiority of the interaction model relative to the other two approaches. This superiority is the result of better stability in performance of the interaction models across data sets and traits; indeed, in almost all cases, the interaction model was either the best performing model or it performed close to the best performing model. ELECTRONIC SUPPLEMENTARY MATERIAL: Supplementary materials for this article are available at 10.1007/s13253-015-0222-5.
format Online
Article
Text
id pubmed-4666286
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-46662862015-12-09 Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions de los Campos, Gustavo Veturi, Yogasudha Vazquez, Ana I. Lehermeier, Christina Pérez-Rodríguez, Paulino J Agric Biol Environ Stat Article Naturally and artificially selected populations usually exhibit some degree of stratification. In Genome-Wide Association Studies and in Whole-Genome Regressions (WGR) analyses, population stratification has been either ignored or dealt with as a potential confounder. However, systematic differences in allele frequency and in patterns of linkage disequilibrium can induce sub-population-specific effects. From this perspective, structure acts as an effect modifier rather than as a confounder. In this article, we extend WGR models commonly used in plant and animal breeding to allow for sub-population-specific effects. This is achieved by decomposing marker effects into main effects and interaction components that describe group-specific deviations. The model can be used both with variable selection and shrinkage methods and can be implemented using existing software for genomic selection. Using a wheat and a pig breeding data set, we compare parameter estimates and the prediction accuracy of the interaction WGR model with WGR analysis ignoring population stratification (across-group analysis) and with a stratified (i.e., within-sub-population) WGR analysis. The interaction model renders trait-specific estimates of the average correlation of effects between sub-populations; we find that such correlation not only depends on the extent of genetic differentiation in allele frequencies between groups but also varies among traits. The evaluation of prediction accuracy shows a modest superiority of the interaction model relative to the other two approaches. This superiority is the result of better stability in performance of the interaction models across data sets and traits; indeed, in almost all cases, the interaction model was either the best performing model or it performed close to the best performing model. ELECTRONIC SUPPLEMENTARY MATERIAL: Supplementary materials for this article are available at 10.1007/s13253-015-0222-5. Springer US 2015-11-09 2015 /pmc/articles/PMC4666286/ /pubmed/26660276 http://dx.doi.org/10.1007/s13253-015-0222-5 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
de los Campos, Gustavo
Veturi, Yogasudha
Vazquez, Ana I.
Lehermeier, Christina
Pérez-Rodríguez, Paulino
Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions
title Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions
title_full Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions
title_fullStr Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions
title_full_unstemmed Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions
title_short Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions
title_sort incorporating genetic heterogeneity in whole-genome regressions using interactions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4666286/
https://www.ncbi.nlm.nih.gov/pubmed/26660276
http://dx.doi.org/10.1007/s13253-015-0222-5
work_keys_str_mv AT deloscamposgustavo incorporatinggeneticheterogeneityinwholegenomeregressionsusinginteractions
AT veturiyogasudha incorporatinggeneticheterogeneityinwholegenomeregressionsusinginteractions
AT vazquezanai incorporatinggeneticheterogeneityinwholegenomeregressionsusinginteractions
AT lehermeierchristina incorporatinggeneticheterogeneityinwholegenomeregressionsusinginteractions
AT perezrodriguezpaulino incorporatinggeneticheterogeneityinwholegenomeregressionsusinginteractions