Cargando…

Identification of key contributors in complex population structures

Evaluating the genetic contribution of individuals to population structure is essential to select informative individuals for genome sequencing, genotype imputation and to ascertain complex population structures. Existing methods for the selection of informative individuals for genomic imputation so...

Descripción completa

Detalles Bibliográficos
Autores principales:	Neuditschko, Markus, Raadsma, Herman W., Khatkar, Mehar S., Jonas, Elisabeth, Steinig, Eike J., Flury, Christine, Signer-Hasler, Heidi, Frischknecht, Mirjam, von Niederhäusern, Ruedi, Leeb, Tosso, Rieder, Stefan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5433729/ https://www.ncbi.nlm.nih.gov/pubmed/28520805 http://dx.doi.org/10.1371/journal.pone.0177638

_version_	1783236909820018688
author	Neuditschko, Markus Raadsma, Herman W. Khatkar, Mehar S. Jonas, Elisabeth Steinig, Eike J. Flury, Christine Signer-Hasler, Heidi Frischknecht, Mirjam von Niederhäusern, Ruedi Leeb, Tosso Rieder, Stefan
author_facet	Neuditschko, Markus Raadsma, Herman W. Khatkar, Mehar S. Jonas, Elisabeth Steinig, Eike J. Flury, Christine Signer-Hasler, Heidi Frischknecht, Mirjam von Niederhäusern, Ruedi Leeb, Tosso Rieder, Stefan
author_sort	Neuditschko, Markus
collection	PubMed
description	Evaluating the genetic contribution of individuals to population structure is essential to select informative individuals for genome sequencing, genotype imputation and to ascertain complex population structures. Existing methods for the selection of informative individuals for genomic imputation solely focus on the identification of key ancestors, which can lead to a loss of phasing accuracy of the reference population. Currently many methods are independently applied to investigate complex population structures. Based on the Eigenvalue Decomposition (EVD) of a genomic relationship matrix we describe a novel approach to evaluate the genetic contribution of individuals to population structure. We combined the identification of key contributors with model-based clustering and population network visualization into an integrated three-step approach, which allows identification of high-resolution population structures and substructures around such key contributors. The approach was applied and validated in four disparate datasets including a simulated population (5,100 individuals and 10,000 SNPs), a highly structured experimental sheep population (1,421 individuals and 44,693 SNPs) and two large complex pedigree populations namely horse (1,077 individuals and 38,124 SNPs) and cattle (2,457 individuals and 45,765 SNPs). In the simulated and experimental sheep dataset, our method, which is unsupervised, successfully identified all known key contributors. Applying our three-step approach to the horse and cattle populations, we observed high-resolution population substructures including the absence of obvious important key contributors. Furthermore, we show that compared to commonly applied strategies to select informative individuals for genotype imputation including the computation of marginal gene contributions (Pedig) and the optimization of genetic relatedness (Rel), the selection of key contributors provided the highest phasing accuracies within the selected reference populations. The presented approach opens new perspectives in the characterization and informed management of populations in general, and in areas such as conservation genetics and selective animal breeding in particular, where assessing the genetic contribution of influential and admixed individuals is crucial for research and management applications. As such, this method provides a valuable complement to common applied tools to visualize complex population structures and to select individuals for re-sequencing.
format	Online Article Text
id	pubmed-5433729
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-54337292017-05-26 Identification of key contributors in complex population structures Neuditschko, Markus Raadsma, Herman W. Khatkar, Mehar S. Jonas, Elisabeth Steinig, Eike J. Flury, Christine Signer-Hasler, Heidi Frischknecht, Mirjam von Niederhäusern, Ruedi Leeb, Tosso Rieder, Stefan PLoS One Research Article Evaluating the genetic contribution of individuals to population structure is essential to select informative individuals for genome sequencing, genotype imputation and to ascertain complex population structures. Existing methods for the selection of informative individuals for genomic imputation solely focus on the identification of key ancestors, which can lead to a loss of phasing accuracy of the reference population. Currently many methods are independently applied to investigate complex population structures. Based on the Eigenvalue Decomposition (EVD) of a genomic relationship matrix we describe a novel approach to evaluate the genetic contribution of individuals to population structure. We combined the identification of key contributors with model-based clustering and population network visualization into an integrated three-step approach, which allows identification of high-resolution population structures and substructures around such key contributors. The approach was applied and validated in four disparate datasets including a simulated population (5,100 individuals and 10,000 SNPs), a highly structured experimental sheep population (1,421 individuals and 44,693 SNPs) and two large complex pedigree populations namely horse (1,077 individuals and 38,124 SNPs) and cattle (2,457 individuals and 45,765 SNPs). In the simulated and experimental sheep dataset, our method, which is unsupervised, successfully identified all known key contributors. Applying our three-step approach to the horse and cattle populations, we observed high-resolution population substructures including the absence of obvious important key contributors. Furthermore, we show that compared to commonly applied strategies to select informative individuals for genotype imputation including the computation of marginal gene contributions (Pedig) and the optimization of genetic relatedness (Rel), the selection of key contributors provided the highest phasing accuracies within the selected reference populations. The presented approach opens new perspectives in the characterization and informed management of populations in general, and in areas such as conservation genetics and selective animal breeding in particular, where assessing the genetic contribution of influential and admixed individuals is crucial for research and management applications. As such, this method provides a valuable complement to common applied tools to visualize complex population structures and to select individuals for re-sequencing. Public Library of Science 2017-05-16 /pmc/articles/PMC5433729/ /pubmed/28520805 http://dx.doi.org/10.1371/journal.pone.0177638 Text en © 2017 Neuditschko et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Neuditschko, Markus Raadsma, Herman W. Khatkar, Mehar S. Jonas, Elisabeth Steinig, Eike J. Flury, Christine Signer-Hasler, Heidi Frischknecht, Mirjam von Niederhäusern, Ruedi Leeb, Tosso Rieder, Stefan Identification of key contributors in complex population structures
title	Identification of key contributors in complex population structures
title_full	Identification of key contributors in complex population structures
title_fullStr	Identification of key contributors in complex population structures
title_full_unstemmed	Identification of key contributors in complex population structures
title_short	Identification of key contributors in complex population structures
title_sort	identification of key contributors in complex population structures
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5433729/ https://www.ncbi.nlm.nih.gov/pubmed/28520805 http://dx.doi.org/10.1371/journal.pone.0177638
work_keys_str_mv	AT neuditschkomarkus identificationofkeycontributorsincomplexpopulationstructures AT raadsmahermanw identificationofkeycontributorsincomplexpopulationstructures AT khatkarmehars identificationofkeycontributorsincomplexpopulationstructures AT jonaselisabeth identificationofkeycontributorsincomplexpopulationstructures AT steinigeikej identificationofkeycontributorsincomplexpopulationstructures AT flurychristine identificationofkeycontributorsincomplexpopulationstructures AT signerhaslerheidi identificationofkeycontributorsincomplexpopulationstructures AT frischknechtmirjam identificationofkeycontributorsincomplexpopulationstructures AT vonniederhausernruedi identificationofkeycontributorsincomplexpopulationstructures AT leebtosso identificationofkeycontributorsincomplexpopulationstructures AT riederstefan identificationofkeycontributorsincomplexpopulationstructures

Identification of key contributors in complex population structures

Ejemplares similares