Cargando…

Generalized Analysis of Molecular Variance

Many studies in the fields of genetic epidemiology and applied population genetics are predicated on, or require, an assessment of the genetic background diversity of the individuals chosen for study. A number of strategies have been developed for assessing genetic background diversity. These strate...

Descripción completa

Detalles Bibliográficos
Autores principales: Nievergelt, Caroline M, Libiger, Ondrej, Schork, Nicholas J
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847693/
https://www.ncbi.nlm.nih.gov/pubmed/17411342
http://dx.doi.org/10.1371/journal.pgen.0030051
_version_ 1782132912601169920
author Nievergelt, Caroline M
Libiger, Ondrej
Schork, Nicholas J
author_facet Nievergelt, Caroline M
Libiger, Ondrej
Schork, Nicholas J
author_sort Nievergelt, Caroline M
collection PubMed
description Many studies in the fields of genetic epidemiology and applied population genetics are predicated on, or require, an assessment of the genetic background diversity of the individuals chosen for study. A number of strategies have been developed for assessing genetic background diversity. These strategies typically focus on genotype data collected on the individuals in the study, based on a panel of DNA markers. However, many of these strategies are either rooted in cluster analysis techniques, and hence suffer from problems inherent to the assignment of the biological and statistical meaning to resulting clusters, or have formulations that do not permit easy and intuitive extensions. We describe a very general approach to the problem of assessing genetic background diversity that extends the analysis of molecular variance (AMOVA) strategy introduced by Excoffier and colleagues some time ago. As in the original AMOVA strategy, the proposed approach, termed generalized AMOVA (GAMOVA), requires a genetic similarity matrix constructed from the allelic profiles of individuals under study and/or allele frequency summaries of the populations from which the individuals have been sampled. The proposed strategy can be used to either estimate the fraction of genetic variation explained by grouping factors such as country of origin, race, or ethnicity, or to quantify the strength of the relationship of the observed genetic background variation to quantitative measures collected on the subjects, such as blood pressure levels or anthropometric measures. Since the formulation of our test statistic is rooted in multivariate linear models, sets of variables can be related to genetic background in multiple regression-like contexts. GAMOVA can also be used to complement graphical representations of genetic diversity such as tree diagrams (dendrograms) or heatmaps. We examine features, advantages, and power of the proposed procedure and showcase its flexibility by using it to analyze a wide variety of published data sets, including data from the Human Genome Diversity Project, classical anthropometry data collected by Howells, and the International HapMap Project.
format Text
id pubmed-1847693
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-18476932007-04-06 Generalized Analysis of Molecular Variance Nievergelt, Caroline M Libiger, Ondrej Schork, Nicholas J PLoS Genet Research Article Many studies in the fields of genetic epidemiology and applied population genetics are predicated on, or require, an assessment of the genetic background diversity of the individuals chosen for study. A number of strategies have been developed for assessing genetic background diversity. These strategies typically focus on genotype data collected on the individuals in the study, based on a panel of DNA markers. However, many of these strategies are either rooted in cluster analysis techniques, and hence suffer from problems inherent to the assignment of the biological and statistical meaning to resulting clusters, or have formulations that do not permit easy and intuitive extensions. We describe a very general approach to the problem of assessing genetic background diversity that extends the analysis of molecular variance (AMOVA) strategy introduced by Excoffier and colleagues some time ago. As in the original AMOVA strategy, the proposed approach, termed generalized AMOVA (GAMOVA), requires a genetic similarity matrix constructed from the allelic profiles of individuals under study and/or allele frequency summaries of the populations from which the individuals have been sampled. The proposed strategy can be used to either estimate the fraction of genetic variation explained by grouping factors such as country of origin, race, or ethnicity, or to quantify the strength of the relationship of the observed genetic background variation to quantitative measures collected on the subjects, such as blood pressure levels or anthropometric measures. Since the formulation of our test statistic is rooted in multivariate linear models, sets of variables can be related to genetic background in multiple regression-like contexts. GAMOVA can also be used to complement graphical representations of genetic diversity such as tree diagrams (dendrograms) or heatmaps. We examine features, advantages, and power of the proposed procedure and showcase its flexibility by using it to analyze a wide variety of published data sets, including data from the Human Genome Diversity Project, classical anthropometry data collected by Howells, and the International HapMap Project. Public Library of Science 2007-04 2007-04-06 /pmc/articles/PMC1847693/ /pubmed/17411342 http://dx.doi.org/10.1371/journal.pgen.0030051 Text en © 2007 Nievergelt et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Nievergelt, Caroline M
Libiger, Ondrej
Schork, Nicholas J
Generalized Analysis of Molecular Variance
title Generalized Analysis of Molecular Variance
title_full Generalized Analysis of Molecular Variance
title_fullStr Generalized Analysis of Molecular Variance
title_full_unstemmed Generalized Analysis of Molecular Variance
title_short Generalized Analysis of Molecular Variance
title_sort generalized analysis of molecular variance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847693/
https://www.ncbi.nlm.nih.gov/pubmed/17411342
http://dx.doi.org/10.1371/journal.pgen.0030051
work_keys_str_mv AT nievergeltcarolinem generalizedanalysisofmolecularvariance
AT libigerondrej generalizedanalysisofmolecularvariance
AT schorknicholasj generalizedanalysisofmolecularvariance