Cargando…

Major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites

STRUCTURE remains the most applied software aimed at recovering the true, but unknown, population structure from microsatellite or other genetic markers. About 30% of structure‐based studies could not be reproduced (Molecular Ecology, 21, 2012, 4925). Here we use a large set of data from 2,323 horse...

Descripción completa

Detalles Bibliográficos
Autores principales: Funk, Stephan Michael, Guedaoura, Sonya, Juras, Rytis, Raziq, Absul, Landolsi, Faouzi, Luís, Cristina, Martínez, Amparo Martínez, Musa Mayaki, Abubakar, Mujica, Fernando, Oom, Maria do Mar, Ouragh, Lahoussine, Stranger, Yves‐Marie, Vega‐Pla, Jose Luis, Cothran, Ernest Gus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7246218/
https://www.ncbi.nlm.nih.gov/pubmed/32489595
http://dx.doi.org/10.1002/ece3.6195
_version_ 1783537897187573760
author Funk, Stephan Michael
Guedaoura, Sonya
Juras, Rytis
Raziq, Absul
Landolsi, Faouzi
Luís, Cristina
Martínez, Amparo Martínez
Musa Mayaki, Abubakar
Mujica, Fernando
Oom, Maria do Mar
Ouragh, Lahoussine
Stranger, Yves‐Marie
Vega‐Pla, Jose Luis
Cothran, Ernest Gus
author_facet Funk, Stephan Michael
Guedaoura, Sonya
Juras, Rytis
Raziq, Absul
Landolsi, Faouzi
Luís, Cristina
Martínez, Amparo Martínez
Musa Mayaki, Abubakar
Mujica, Fernando
Oom, Maria do Mar
Ouragh, Lahoussine
Stranger, Yves‐Marie
Vega‐Pla, Jose Luis
Cothran, Ernest Gus
author_sort Funk, Stephan Michael
collection PubMed
description STRUCTURE remains the most applied software aimed at recovering the true, but unknown, population structure from microsatellite or other genetic markers. About 30% of structure‐based studies could not be reproduced (Molecular Ecology, 21, 2012, 4925). Here we use a large set of data from 2,323 horses from 93 domestic breeds plus the Przewalski horse, typed at 15 microsatellites, to evaluate how program settings impact the estimation of the optimal number of population clusters K (opt) that best describe the observed data. Domestic horses are suited as a test case as there is extensive background knowledge on the history of many breeds and extensive phylogenetic analyses. Different methods based on different genetic assumptions and statistical procedures (dapc, flock, PCoA, and structure with different run scenarios) all revealed general, broad‐scale breed relationships that largely reflect known breed histories but diverged how they characterized small‐scale patterns. structure failed to consistently identify K (opt) using the most widespread approach, the ΔK method, despite very large numbers of MCMC iterations (3,000,000) and replicates (100). The interpretation of breed structure over increasing numbers of K, without assuming a K (opt), was consistent with known breed histories. The over‐reliance on K (opt) should be replaced by a qualitative description of clustering over increasing K, which is scientifically more honest and has the advantage of being much faster and less computer intensive as lower numbers of MCMC iterations and repetitions suffice for stable results. Very large data sets are highly challenging for cluster analyses, especially when populations with complex genetic histories are investigated.
format Online
Article
Text
id pubmed-7246218
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-72462182020-06-01 Major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites Funk, Stephan Michael Guedaoura, Sonya Juras, Rytis Raziq, Absul Landolsi, Faouzi Luís, Cristina Martínez, Amparo Martínez Musa Mayaki, Abubakar Mujica, Fernando Oom, Maria do Mar Ouragh, Lahoussine Stranger, Yves‐Marie Vega‐Pla, Jose Luis Cothran, Ernest Gus Ecol Evol Original Research STRUCTURE remains the most applied software aimed at recovering the true, but unknown, population structure from microsatellite or other genetic markers. About 30% of structure‐based studies could not be reproduced (Molecular Ecology, 21, 2012, 4925). Here we use a large set of data from 2,323 horses from 93 domestic breeds plus the Przewalski horse, typed at 15 microsatellites, to evaluate how program settings impact the estimation of the optimal number of population clusters K (opt) that best describe the observed data. Domestic horses are suited as a test case as there is extensive background knowledge on the history of many breeds and extensive phylogenetic analyses. Different methods based on different genetic assumptions and statistical procedures (dapc, flock, PCoA, and structure with different run scenarios) all revealed general, broad‐scale breed relationships that largely reflect known breed histories but diverged how they characterized small‐scale patterns. structure failed to consistently identify K (opt) using the most widespread approach, the ΔK method, despite very large numbers of MCMC iterations (3,000,000) and replicates (100). The interpretation of breed structure over increasing numbers of K, without assuming a K (opt), was consistent with known breed histories. The over‐reliance on K (opt) should be replaced by a qualitative description of clustering over increasing K, which is scientifically more honest and has the advantage of being much faster and less computer intensive as lower numbers of MCMC iterations and repetitions suffice for stable results. Very large data sets are highly challenging for cluster analyses, especially when populations with complex genetic histories are investigated. John Wiley and Sons Inc. 2020-04-12 /pmc/articles/PMC7246218/ /pubmed/32489595 http://dx.doi.org/10.1002/ece3.6195 Text en © 2020 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Funk, Stephan Michael
Guedaoura, Sonya
Juras, Rytis
Raziq, Absul
Landolsi, Faouzi
Luís, Cristina
Martínez, Amparo Martínez
Musa Mayaki, Abubakar
Mujica, Fernando
Oom, Maria do Mar
Ouragh, Lahoussine
Stranger, Yves‐Marie
Vega‐Pla, Jose Luis
Cothran, Ernest Gus
Major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites
title Major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites
title_full Major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites
title_fullStr Major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites
title_full_unstemmed Major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites
title_short Major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites
title_sort major inconsistencies of inferred population genetic structure estimated in a large set of domestic horse breeds using microsatellites
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7246218/
https://www.ncbi.nlm.nih.gov/pubmed/32489595
http://dx.doi.org/10.1002/ece3.6195
work_keys_str_mv AT funkstephanmichael majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT guedaourasonya majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT jurasrytis majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT raziqabsul majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT landolsifaouzi majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT luiscristina majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT martinezamparomartinez majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT musamayakiabubakar majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT mujicafernando majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT oommariadomar majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT ouraghlahoussine majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT strangeryvesmarie majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT vegaplajoseluis majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites
AT cothranernestgus majorinconsistenciesofinferredpopulationgeneticstructureestimatedinalargesetofdomestichorsebreedsusingmicrosatellites