Cargando…
An Arabidopsis Example of Association Mapping in Structured Samples
A potentially serious disadvantage of association mapping is the fact that marker-trait associations may arise from confounding population structure as well as from linkage to causative polymorphisms. Using genome-wide marker data, we have previously demonstrated that the problem can be severe in a...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779303/ https://www.ncbi.nlm.nih.gov/pubmed/17238287 http://dx.doi.org/10.1371/journal.pgen.0030004 |
_version_ | 1782131748844339200 |
---|---|
author | Zhao, Keyan Aranzana, María José Kim, Sung Lister, Clare Shindo, Chikako Tang, Chunlao Toomajian, Christopher Zheng, Honggang Dean, Caroline Marjoram, Paul Nordborg, Magnus |
author_facet | Zhao, Keyan Aranzana, María José Kim, Sung Lister, Clare Shindo, Chikako Tang, Chunlao Toomajian, Christopher Zheng, Honggang Dean, Caroline Marjoram, Paul Nordborg, Magnus |
author_sort | Zhao, Keyan |
collection | PubMed |
description | A potentially serious disadvantage of association mapping is the fact that marker-trait associations may arise from confounding population structure as well as from linkage to causative polymorphisms. Using genome-wide marker data, we have previously demonstrated that the problem can be severe in a global sample of 95 Arabidopsis thaliana accessions, and that established methods for controlling for population structure are generally insufficient. Here, we use the same sample together with a number of flowering-related phenotypes and data-perturbation simulations to evaluate a wider range of methods for controlling for population structure. We find that, in terms of reducing the false-positive rate while maintaining statistical power, a recently introduced mixed-model approach that takes genome-wide differences in relatedness into account via estimated pairwise kinship coefficients generally performs best. By combining the association results with results from linkage mapping in F2 crosses, we identify one previously known true positive and several promising new associations, but also demonstrate the existence of both false positives and false negatives. Our results illustrate the potential of genome-wide association scans as a tool for dissecting the genetics of natural variation, while at the same time highlighting the pitfalls. The importance of study design is clear; our study is severely under-powered both in terms of sample size and marker density. Our results also provide a striking demonstration of confounding by population structure. While statistical methods can be used to ameliorate this problem, they cannot always be effective and are certainly not a substitute for independent evidence, such as that obtained via crosses or transgenic experiments. Ultimately, association mapping is a powerful tool for identifying a list of candidates that is short enough to permit further genetic study. |
format | Text |
id | pubmed-1779303 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-17793032007-01-20 An Arabidopsis Example of Association Mapping in Structured Samples Zhao, Keyan Aranzana, María José Kim, Sung Lister, Clare Shindo, Chikako Tang, Chunlao Toomajian, Christopher Zheng, Honggang Dean, Caroline Marjoram, Paul Nordborg, Magnus PLoS Genet Research Article A potentially serious disadvantage of association mapping is the fact that marker-trait associations may arise from confounding population structure as well as from linkage to causative polymorphisms. Using genome-wide marker data, we have previously demonstrated that the problem can be severe in a global sample of 95 Arabidopsis thaliana accessions, and that established methods for controlling for population structure are generally insufficient. Here, we use the same sample together with a number of flowering-related phenotypes and data-perturbation simulations to evaluate a wider range of methods for controlling for population structure. We find that, in terms of reducing the false-positive rate while maintaining statistical power, a recently introduced mixed-model approach that takes genome-wide differences in relatedness into account via estimated pairwise kinship coefficients generally performs best. By combining the association results with results from linkage mapping in F2 crosses, we identify one previously known true positive and several promising new associations, but also demonstrate the existence of both false positives and false negatives. Our results illustrate the potential of genome-wide association scans as a tool for dissecting the genetics of natural variation, while at the same time highlighting the pitfalls. The importance of study design is clear; our study is severely under-powered both in terms of sample size and marker density. Our results also provide a striking demonstration of confounding by population structure. While statistical methods can be used to ameliorate this problem, they cannot always be effective and are certainly not a substitute for independent evidence, such as that obtained via crosses or transgenic experiments. Ultimately, association mapping is a powerful tool for identifying a list of candidates that is short enough to permit further genetic study. Public Library of Science 2007-01 2007-01-19 /pmc/articles/PMC1779303/ /pubmed/17238287 http://dx.doi.org/10.1371/journal.pgen.0030004 Text en © 2007 Zhao et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Zhao, Keyan Aranzana, María José Kim, Sung Lister, Clare Shindo, Chikako Tang, Chunlao Toomajian, Christopher Zheng, Honggang Dean, Caroline Marjoram, Paul Nordborg, Magnus An Arabidopsis Example of Association Mapping in Structured Samples |
title | An Arabidopsis Example of Association Mapping in Structured Samples |
title_full | An Arabidopsis Example of Association Mapping in Structured Samples |
title_fullStr | An Arabidopsis Example of Association Mapping in Structured Samples |
title_full_unstemmed | An Arabidopsis Example of Association Mapping in Structured Samples |
title_short | An Arabidopsis Example of Association Mapping in Structured Samples |
title_sort | arabidopsis example of association mapping in structured samples |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779303/ https://www.ncbi.nlm.nih.gov/pubmed/17238287 http://dx.doi.org/10.1371/journal.pgen.0030004 |
work_keys_str_mv | AT zhaokeyan anarabidopsisexampleofassociationmappinginstructuredsamples AT aranzanamariajose anarabidopsisexampleofassociationmappinginstructuredsamples AT kimsung anarabidopsisexampleofassociationmappinginstructuredsamples AT listerclare anarabidopsisexampleofassociationmappinginstructuredsamples AT shindochikako anarabidopsisexampleofassociationmappinginstructuredsamples AT tangchunlao anarabidopsisexampleofassociationmappinginstructuredsamples AT toomajianchristopher anarabidopsisexampleofassociationmappinginstructuredsamples AT zhenghonggang anarabidopsisexampleofassociationmappinginstructuredsamples AT deancaroline anarabidopsisexampleofassociationmappinginstructuredsamples AT marjorampaul anarabidopsisexampleofassociationmappinginstructuredsamples AT nordborgmagnus anarabidopsisexampleofassociationmappinginstructuredsamples AT zhaokeyan arabidopsisexampleofassociationmappinginstructuredsamples AT aranzanamariajose arabidopsisexampleofassociationmappinginstructuredsamples AT kimsung arabidopsisexampleofassociationmappinginstructuredsamples AT listerclare arabidopsisexampleofassociationmappinginstructuredsamples AT shindochikako arabidopsisexampleofassociationmappinginstructuredsamples AT tangchunlao arabidopsisexampleofassociationmappinginstructuredsamples AT toomajianchristopher arabidopsisexampleofassociationmappinginstructuredsamples AT zhenghonggang arabidopsisexampleofassociationmappinginstructuredsamples AT deancaroline arabidopsisexampleofassociationmappinginstructuredsamples AT marjorampaul arabidopsisexampleofassociationmappinginstructuredsamples AT nordborgmagnus arabidopsisexampleofassociationmappinginstructuredsamples |