Cargando…
Generalized Analysis of Molecular Variance
Many studies in the fields of genetic epidemiology and applied population genetics are predicated on, or require, an assessment of the genetic background diversity of the individuals chosen for study. A number of strategies have been developed for assessing genetic background diversity. These strate...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847693/ https://www.ncbi.nlm.nih.gov/pubmed/17411342 http://dx.doi.org/10.1371/journal.pgen.0030051 |
_version_ | 1782132912601169920 |
---|---|
author | Nievergelt, Caroline M Libiger, Ondrej Schork, Nicholas J |
author_facet | Nievergelt, Caroline M Libiger, Ondrej Schork, Nicholas J |
author_sort | Nievergelt, Caroline M |
collection | PubMed |
description | Many studies in the fields of genetic epidemiology and applied population genetics are predicated on, or require, an assessment of the genetic background diversity of the individuals chosen for study. A number of strategies have been developed for assessing genetic background diversity. These strategies typically focus on genotype data collected on the individuals in the study, based on a panel of DNA markers. However, many of these strategies are either rooted in cluster analysis techniques, and hence suffer from problems inherent to the assignment of the biological and statistical meaning to resulting clusters, or have formulations that do not permit easy and intuitive extensions. We describe a very general approach to the problem of assessing genetic background diversity that extends the analysis of molecular variance (AMOVA) strategy introduced by Excoffier and colleagues some time ago. As in the original AMOVA strategy, the proposed approach, termed generalized AMOVA (GAMOVA), requires a genetic similarity matrix constructed from the allelic profiles of individuals under study and/or allele frequency summaries of the populations from which the individuals have been sampled. The proposed strategy can be used to either estimate the fraction of genetic variation explained by grouping factors such as country of origin, race, or ethnicity, or to quantify the strength of the relationship of the observed genetic background variation to quantitative measures collected on the subjects, such as blood pressure levels or anthropometric measures. Since the formulation of our test statistic is rooted in multivariate linear models, sets of variables can be related to genetic background in multiple regression-like contexts. GAMOVA can also be used to complement graphical representations of genetic diversity such as tree diagrams (dendrograms) or heatmaps. We examine features, advantages, and power of the proposed procedure and showcase its flexibility by using it to analyze a wide variety of published data sets, including data from the Human Genome Diversity Project, classical anthropometry data collected by Howells, and the International HapMap Project. |
format | Text |
id | pubmed-1847693 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-18476932007-04-06 Generalized Analysis of Molecular Variance Nievergelt, Caroline M Libiger, Ondrej Schork, Nicholas J PLoS Genet Research Article Many studies in the fields of genetic epidemiology and applied population genetics are predicated on, or require, an assessment of the genetic background diversity of the individuals chosen for study. A number of strategies have been developed for assessing genetic background diversity. These strategies typically focus on genotype data collected on the individuals in the study, based on a panel of DNA markers. However, many of these strategies are either rooted in cluster analysis techniques, and hence suffer from problems inherent to the assignment of the biological and statistical meaning to resulting clusters, or have formulations that do not permit easy and intuitive extensions. We describe a very general approach to the problem of assessing genetic background diversity that extends the analysis of molecular variance (AMOVA) strategy introduced by Excoffier and colleagues some time ago. As in the original AMOVA strategy, the proposed approach, termed generalized AMOVA (GAMOVA), requires a genetic similarity matrix constructed from the allelic profiles of individuals under study and/or allele frequency summaries of the populations from which the individuals have been sampled. The proposed strategy can be used to either estimate the fraction of genetic variation explained by grouping factors such as country of origin, race, or ethnicity, or to quantify the strength of the relationship of the observed genetic background variation to quantitative measures collected on the subjects, such as blood pressure levels or anthropometric measures. Since the formulation of our test statistic is rooted in multivariate linear models, sets of variables can be related to genetic background in multiple regression-like contexts. GAMOVA can also be used to complement graphical representations of genetic diversity such as tree diagrams (dendrograms) or heatmaps. We examine features, advantages, and power of the proposed procedure and showcase its flexibility by using it to analyze a wide variety of published data sets, including data from the Human Genome Diversity Project, classical anthropometry data collected by Howells, and the International HapMap Project. Public Library of Science 2007-04 2007-04-06 /pmc/articles/PMC1847693/ /pubmed/17411342 http://dx.doi.org/10.1371/journal.pgen.0030051 Text en © 2007 Nievergelt et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Nievergelt, Caroline M Libiger, Ondrej Schork, Nicholas J Generalized Analysis of Molecular Variance |
title | Generalized Analysis of Molecular Variance |
title_full | Generalized Analysis of Molecular Variance |
title_fullStr | Generalized Analysis of Molecular Variance |
title_full_unstemmed | Generalized Analysis of Molecular Variance |
title_short | Generalized Analysis of Molecular Variance |
title_sort | generalized analysis of molecular variance |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847693/ https://www.ncbi.nlm.nih.gov/pubmed/17411342 http://dx.doi.org/10.1371/journal.pgen.0030051 |
work_keys_str_mv | AT nievergeltcarolinem generalizedanalysisofmolecularvariance AT libigerondrej generalizedanalysisofmolecularvariance AT schorknicholasj generalizedanalysisofmolecularvariance |