Cargando…

An Arabidopsis Example of Association Mapping in Structured Samples

A potentially serious disadvantage of association mapping is the fact that marker-trait associations may arise from confounding population structure as well as from linkage to causative polymorphisms. Using genome-wide marker data, we have previously demonstrated that the problem can be severe in a...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Keyan, Aranzana, María José, Kim, Sung, Lister, Clare, Shindo, Chikako, Tang, Chunlao, Toomajian, Christopher, Zheng, Honggang, Dean, Caroline, Marjoram, Paul, Nordborg, Magnus
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779303/
https://www.ncbi.nlm.nih.gov/pubmed/17238287
http://dx.doi.org/10.1371/journal.pgen.0030004
_version_ 1782131748844339200
author Zhao, Keyan
Aranzana, María José
Kim, Sung
Lister, Clare
Shindo, Chikako
Tang, Chunlao
Toomajian, Christopher
Zheng, Honggang
Dean, Caroline
Marjoram, Paul
Nordborg, Magnus
author_facet Zhao, Keyan
Aranzana, María José
Kim, Sung
Lister, Clare
Shindo, Chikako
Tang, Chunlao
Toomajian, Christopher
Zheng, Honggang
Dean, Caroline
Marjoram, Paul
Nordborg, Magnus
author_sort Zhao, Keyan
collection PubMed
description A potentially serious disadvantage of association mapping is the fact that marker-trait associations may arise from confounding population structure as well as from linkage to causative polymorphisms. Using genome-wide marker data, we have previously demonstrated that the problem can be severe in a global sample of 95 Arabidopsis thaliana accessions, and that established methods for controlling for population structure are generally insufficient. Here, we use the same sample together with a number of flowering-related phenotypes and data-perturbation simulations to evaluate a wider range of methods for controlling for population structure. We find that, in terms of reducing the false-positive rate while maintaining statistical power, a recently introduced mixed-model approach that takes genome-wide differences in relatedness into account via estimated pairwise kinship coefficients generally performs best. By combining the association results with results from linkage mapping in F2 crosses, we identify one previously known true positive and several promising new associations, but also demonstrate the existence of both false positives and false negatives. Our results illustrate the potential of genome-wide association scans as a tool for dissecting the genetics of natural variation, while at the same time highlighting the pitfalls. The importance of study design is clear; our study is severely under-powered both in terms of sample size and marker density. Our results also provide a striking demonstration of confounding by population structure. While statistical methods can be used to ameliorate this problem, they cannot always be effective and are certainly not a substitute for independent evidence, such as that obtained via crosses or transgenic experiments. Ultimately, association mapping is a powerful tool for identifying a list of candidates that is short enough to permit further genetic study.
format Text
id pubmed-1779303
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-17793032007-01-20 An Arabidopsis Example of Association Mapping in Structured Samples Zhao, Keyan Aranzana, María José Kim, Sung Lister, Clare Shindo, Chikako Tang, Chunlao Toomajian, Christopher Zheng, Honggang Dean, Caroline Marjoram, Paul Nordborg, Magnus PLoS Genet Research Article A potentially serious disadvantage of association mapping is the fact that marker-trait associations may arise from confounding population structure as well as from linkage to causative polymorphisms. Using genome-wide marker data, we have previously demonstrated that the problem can be severe in a global sample of 95 Arabidopsis thaliana accessions, and that established methods for controlling for population structure are generally insufficient. Here, we use the same sample together with a number of flowering-related phenotypes and data-perturbation simulations to evaluate a wider range of methods for controlling for population structure. We find that, in terms of reducing the false-positive rate while maintaining statistical power, a recently introduced mixed-model approach that takes genome-wide differences in relatedness into account via estimated pairwise kinship coefficients generally performs best. By combining the association results with results from linkage mapping in F2 crosses, we identify one previously known true positive and several promising new associations, but also demonstrate the existence of both false positives and false negatives. Our results illustrate the potential of genome-wide association scans as a tool for dissecting the genetics of natural variation, while at the same time highlighting the pitfalls. The importance of study design is clear; our study is severely under-powered both in terms of sample size and marker density. Our results also provide a striking demonstration of confounding by population structure. While statistical methods can be used to ameliorate this problem, they cannot always be effective and are certainly not a substitute for independent evidence, such as that obtained via crosses or transgenic experiments. Ultimately, association mapping is a powerful tool for identifying a list of candidates that is short enough to permit further genetic study. Public Library of Science 2007-01 2007-01-19 /pmc/articles/PMC1779303/ /pubmed/17238287 http://dx.doi.org/10.1371/journal.pgen.0030004 Text en © 2007 Zhao et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhao, Keyan
Aranzana, María José
Kim, Sung
Lister, Clare
Shindo, Chikako
Tang, Chunlao
Toomajian, Christopher
Zheng, Honggang
Dean, Caroline
Marjoram, Paul
Nordborg, Magnus
An Arabidopsis Example of Association Mapping in Structured Samples
title An Arabidopsis Example of Association Mapping in Structured Samples
title_full An Arabidopsis Example of Association Mapping in Structured Samples
title_fullStr An Arabidopsis Example of Association Mapping in Structured Samples
title_full_unstemmed An Arabidopsis Example of Association Mapping in Structured Samples
title_short An Arabidopsis Example of Association Mapping in Structured Samples
title_sort arabidopsis example of association mapping in structured samples
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779303/
https://www.ncbi.nlm.nih.gov/pubmed/17238287
http://dx.doi.org/10.1371/journal.pgen.0030004
work_keys_str_mv AT zhaokeyan anarabidopsisexampleofassociationmappinginstructuredsamples
AT aranzanamariajose anarabidopsisexampleofassociationmappinginstructuredsamples
AT kimsung anarabidopsisexampleofassociationmappinginstructuredsamples
AT listerclare anarabidopsisexampleofassociationmappinginstructuredsamples
AT shindochikako anarabidopsisexampleofassociationmappinginstructuredsamples
AT tangchunlao anarabidopsisexampleofassociationmappinginstructuredsamples
AT toomajianchristopher anarabidopsisexampleofassociationmappinginstructuredsamples
AT zhenghonggang anarabidopsisexampleofassociationmappinginstructuredsamples
AT deancaroline anarabidopsisexampleofassociationmappinginstructuredsamples
AT marjorampaul anarabidopsisexampleofassociationmappinginstructuredsamples
AT nordborgmagnus anarabidopsisexampleofassociationmappinginstructuredsamples
AT zhaokeyan arabidopsisexampleofassociationmappinginstructuredsamples
AT aranzanamariajose arabidopsisexampleofassociationmappinginstructuredsamples
AT kimsung arabidopsisexampleofassociationmappinginstructuredsamples
AT listerclare arabidopsisexampleofassociationmappinginstructuredsamples
AT shindochikako arabidopsisexampleofassociationmappinginstructuredsamples
AT tangchunlao arabidopsisexampleofassociationmappinginstructuredsamples
AT toomajianchristopher arabidopsisexampleofassociationmappinginstructuredsamples
AT zhenghonggang arabidopsisexampleofassociationmappinginstructuredsamples
AT deancaroline arabidopsisexampleofassociationmappinginstructuredsamples
AT marjorampaul arabidopsisexampleofassociationmappinginstructuredsamples
AT nordborgmagnus arabidopsisexampleofassociationmappinginstructuredsamples