Cargando…

The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)

A typical plant genome-wide association study (GWAS) uses a mixed linear model (MLM) that includes a trait as the response variable, a marker as an explanatory variable, and fixed and random effect covariates accounting for population structure and relatedness. Although effective in controlling for...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Angela H., Lipka, Alexander E.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Genetics Society of America 2016
Materias:	Investigations
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4978891/ https://www.ncbi.nlm.nih.gov/pubmed/27233668 http://dx.doi.org/10.1534/g3.116.029090

_version_	1782447234266169344
author	Chen, Angela H. Lipka, Alexander E.
author_facet	Chen, Angela H. Lipka, Alexander E.
author_sort	Chen, Angela H.
collection	PubMed
description	A typical plant genome-wide association study (GWAS) uses a mixed linear model (MLM) that includes a trait as the response variable, a marker as an explanatory variable, and fixed and random effect covariates accounting for population structure and relatedness. Although effective in controlling for false positive signals, this model typically fails to detect signals that are correlated with population structure or are located in high linkage disequilibrium (LD) genomic regions. This result likely arises from each tested marker being used to estimate population structure and relatedness. Previous work has demonstrated that it is possible to increase the power of the MLM by estimating relatedness (i.e., kinship) with markers that are not located on the chromosome where the tested marker resides. To quantify the amount of additional significant signals one can expect using this so-called K_chr model, we reanalyzed Mendelian, polygenic, and complex traits in two maize (Zea mays L.) diversity panels that have been previously assessed using the traditional MLM. We demonstrated that the K_chr model could find more significant associations, especially in high LD regions. This finding is underscored by our identification of novel genomic signals proximal to the tocochromanol biosynthetic pathway gene ZmVTE1 that are associated with a ratio of tocotrienols. We conclude that the K_chr model can detect more intricate sources of allelic variation underlying agronomically important traits, and should therefore become more widely used for GWAS. To facilitate the implementation of the K_chr model, we provide code written in the R programming language.
format	Online Article Text
id	pubmed-4978891
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Genetics Society of America
record_format	MEDLINE/PubMed
spelling	pubmed-49788912016-08-18 The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.) Chen, Angela H. Lipka, Alexander E. G3 (Bethesda) Investigations A typical plant genome-wide association study (GWAS) uses a mixed linear model (MLM) that includes a trait as the response variable, a marker as an explanatory variable, and fixed and random effect covariates accounting for population structure and relatedness. Although effective in controlling for false positive signals, this model typically fails to detect signals that are correlated with population structure or are located in high linkage disequilibrium (LD) genomic regions. This result likely arises from each tested marker being used to estimate population structure and relatedness. Previous work has demonstrated that it is possible to increase the power of the MLM by estimating relatedness (i.e., kinship) with markers that are not located on the chromosome where the tested marker resides. To quantify the amount of additional significant signals one can expect using this so-called K_chr model, we reanalyzed Mendelian, polygenic, and complex traits in two maize (Zea mays L.) diversity panels that have been previously assessed using the traditional MLM. We demonstrated that the K_chr model could find more significant associations, especially in high LD regions. This finding is underscored by our identification of novel genomic signals proximal to the tocochromanol biosynthetic pathway gene ZmVTE1 that are associated with a ratio of tocotrienols. We conclude that the K_chr model can detect more intricate sources of allelic variation underlying agronomically important traits, and should therefore become more widely used for GWAS. To facilitate the implementation of the K_chr model, we provide code written in the R programming language. Genetics Society of America 2016-05-26 /pmc/articles/PMC4978891/ /pubmed/27233668 http://dx.doi.org/10.1534/g3.116.029090 Text en Copyright © 2016 Chen and Lipka http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Investigations Chen, Angela H. Lipka, Alexander E. The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title	The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_full	The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_fullStr	The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_full_unstemmed	The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_short	The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_sort	use of targeted marker subsets to account for population structure and relatedness in genome-wide association studies of maize (zea mays l.)
topic	Investigations
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4978891/ https://www.ncbi.nlm.nih.gov/pubmed/27233668 http://dx.doi.org/10.1534/g3.116.029090
work_keys_str_mv	AT chenangelah theuseoftargetedmarkersubsetstoaccountforpopulationstructureandrelatednessingenomewideassociationstudiesofmaizezeamaysl AT lipkaalexandere theuseoftargetedmarkersubsetstoaccountforpopulationstructureandrelatednessingenomewideassociationstudiesofmaizezeamaysl AT chenangelah useoftargetedmarkersubsetstoaccountforpopulationstructureandrelatednessingenomewideassociationstudiesofmaizezeamaysl AT lipkaalexandere useoftargetedmarkersubsetstoaccountforpopulationstructureandrelatednessingenomewideassociationstudiesofmaizezeamaysl

The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)

Ejemplares similares