Cargando…

Network-based hierarchical population structure analysis for large genomic data sets

Analysis of population structure in natural populations using genetic data is a common practice in ecological and evolutionary studies. With large genomic data sets of populations now appearing more frequently across the taxonomic spectrum, it is becoming increasingly possible to reveal many hierarc...

Descripción completa

Detalles Bibliográficos
Autores principales: Greenbaum, Gili, Rubin, Amir, Templeton, Alan R., Rosenberg, Noah A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886512/
https://www.ncbi.nlm.nih.gov/pubmed/31694865
http://dx.doi.org/10.1101/gr.250092.119
_version_ 1783474888281948160
author Greenbaum, Gili
Rubin, Amir
Templeton, Alan R.
Rosenberg, Noah A.
author_facet Greenbaum, Gili
Rubin, Amir
Templeton, Alan R.
Rosenberg, Noah A.
author_sort Greenbaum, Gili
collection PubMed
description Analysis of population structure in natural populations using genetic data is a common practice in ecological and evolutionary studies. With large genomic data sets of populations now appearing more frequently across the taxonomic spectrum, it is becoming increasingly possible to reveal many hierarchical levels of structure, including fine-scale genetic clusters. To analyze these data sets, methods need to be appropriately suited to the challenges of extracting multilevel structure from whole-genome data. Here, we present a network-based approach for constructing population structure representations from genetic data. The use of community-detection algorithms from network theory generates a natural hierarchical perspective on the representation that the method produces. The method is computationally efficient, and it requires relatively few assumptions regarding the biological processes that underlie the data. We show the approach by analyzing population structure in the model plant species Arabidopsis thaliana and in human populations. These examples illustrate how network-based approaches for population structure analysis are well-suited to extracting valuable ecological and evolutionary information in the era of large genomic data sets.
format Online
Article
Text
id pubmed-6886512
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-68865122020-06-01 Network-based hierarchical population structure analysis for large genomic data sets Greenbaum, Gili Rubin, Amir Templeton, Alan R. Rosenberg, Noah A. Genome Res Method Analysis of population structure in natural populations using genetic data is a common practice in ecological and evolutionary studies. With large genomic data sets of populations now appearing more frequently across the taxonomic spectrum, it is becoming increasingly possible to reveal many hierarchical levels of structure, including fine-scale genetic clusters. To analyze these data sets, methods need to be appropriately suited to the challenges of extracting multilevel structure from whole-genome data. Here, we present a network-based approach for constructing population structure representations from genetic data. The use of community-detection algorithms from network theory generates a natural hierarchical perspective on the representation that the method produces. The method is computationally efficient, and it requires relatively few assumptions regarding the biological processes that underlie the data. We show the approach by analyzing population structure in the model plant species Arabidopsis thaliana and in human populations. These examples illustrate how network-based approaches for population structure analysis are well-suited to extracting valuable ecological and evolutionary information in the era of large genomic data sets. Cold Spring Harbor Laboratory Press 2019-12 /pmc/articles/PMC6886512/ /pubmed/31694865 http://dx.doi.org/10.1101/gr.250092.119 Text en © 2019 Greenbaum et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Greenbaum, Gili
Rubin, Amir
Templeton, Alan R.
Rosenberg, Noah A.
Network-based hierarchical population structure analysis for large genomic data sets
title Network-based hierarchical population structure analysis for large genomic data sets
title_full Network-based hierarchical population structure analysis for large genomic data sets
title_fullStr Network-based hierarchical population structure analysis for large genomic data sets
title_full_unstemmed Network-based hierarchical population structure analysis for large genomic data sets
title_short Network-based hierarchical population structure analysis for large genomic data sets
title_sort network-based hierarchical population structure analysis for large genomic data sets
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886512/
https://www.ncbi.nlm.nih.gov/pubmed/31694865
http://dx.doi.org/10.1101/gr.250092.119
work_keys_str_mv AT greenbaumgili networkbasedhierarchicalpopulationstructureanalysisforlargegenomicdatasets
AT rubinamir networkbasedhierarchicalpopulationstructureanalysisforlargegenomicdatasets
AT templetonalanr networkbasedhierarchicalpopulationstructureanalysisforlargegenomicdatasets
AT rosenbergnoaha networkbasedhierarchicalpopulationstructureanalysisforlargegenomicdatasets