Cargando…

Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures

BACKGROUND: Existing algorithms and methods for forming diverse core subsets currently address either allele representativeness (breeder's preference) or allele richness (taxonomist's preference). The main objective of this paper is to propose a powerful yet flexible algorithm capable of s...

Descripción completa

Detalles Bibliográficos
Autores principales: Thachuk, Chris, Crossa, José, Franco, Jorge, Dreisigacker, Susanne, Warburton, Marilyn, Davenport, Guy F
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2734557/
https://www.ncbi.nlm.nih.gov/pubmed/19660135
http://dx.doi.org/10.1186/1471-2105-10-243
_version_ 1782171157483487232
author Thachuk, Chris
Crossa, José
Franco, Jorge
Dreisigacker, Susanne
Warburton, Marilyn
Davenport, Guy F
author_facet Thachuk, Chris
Crossa, José
Franco, Jorge
Dreisigacker, Susanne
Warburton, Marilyn
Davenport, Guy F
author_sort Thachuk, Chris
collection PubMed
description BACKGROUND: Existing algorithms and methods for forming diverse core subsets currently address either allele representativeness (breeder's preference) or allele richness (taxonomist's preference). The main objective of this paper is to propose a powerful yet flexible algorithm capable of selecting core subsets that have high average genetic distance between accessions, or rich genetic diversity overall, or a combination of both. RESULTS: We present Core Hunter, an advanced stochastic local search algorithm for selecting core subsets. Core Hunter is able to find core subsets having more genetic diversity and better average genetic distance than the current state-of-the-art algorithms for all genetic distance and diversity measures we evaluated. Furthermore, Core Hunter can attempt to optimize any number of genetic measures simultaneously, based on the preference of the user. Notably, Core Hunter is able to select significantly smaller core subsets, which retain all unique alleles from a reference collection, than state-of-the-art algorithms. CONCLUSION: Core Hunter is a highly effective and flexible tool for sampling genetic resources and establishing core subsets. Our implementation, documentation, and source code for Core Hunter is available at
format Text
id pubmed-2734557
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27345572009-08-29 Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures Thachuk, Chris Crossa, José Franco, Jorge Dreisigacker, Susanne Warburton, Marilyn Davenport, Guy F BMC Bioinformatics Research Article BACKGROUND: Existing algorithms and methods for forming diverse core subsets currently address either allele representativeness (breeder's preference) or allele richness (taxonomist's preference). The main objective of this paper is to propose a powerful yet flexible algorithm capable of selecting core subsets that have high average genetic distance between accessions, or rich genetic diversity overall, or a combination of both. RESULTS: We present Core Hunter, an advanced stochastic local search algorithm for selecting core subsets. Core Hunter is able to find core subsets having more genetic diversity and better average genetic distance than the current state-of-the-art algorithms for all genetic distance and diversity measures we evaluated. Furthermore, Core Hunter can attempt to optimize any number of genetic measures simultaneously, based on the preference of the user. Notably, Core Hunter is able to select significantly smaller core subsets, which retain all unique alleles from a reference collection, than state-of-the-art algorithms. CONCLUSION: Core Hunter is a highly effective and flexible tool for sampling genetic resources and establishing core subsets. Our implementation, documentation, and source code for Core Hunter is available at BioMed Central 2009-08-06 /pmc/articles/PMC2734557/ /pubmed/19660135 http://dx.doi.org/10.1186/1471-2105-10-243 Text en Copyright © 2009 Thachuk et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Thachuk, Chris
Crossa, José
Franco, Jorge
Dreisigacker, Susanne
Warburton, Marilyn
Davenport, Guy F
Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures
title Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures
title_full Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures
title_fullStr Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures
title_full_unstemmed Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures
title_short Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures
title_sort core hunter: an algorithm for sampling genetic resources based on multiple genetic measures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2734557/
https://www.ncbi.nlm.nih.gov/pubmed/19660135
http://dx.doi.org/10.1186/1471-2105-10-243
work_keys_str_mv AT thachukchris corehunteranalgorithmforsamplinggeneticresourcesbasedonmultiplegeneticmeasures
AT crossajose corehunteranalgorithmforsamplinggeneticresourcesbasedonmultiplegeneticmeasures
AT francojorge corehunteranalgorithmforsamplinggeneticresourcesbasedonmultiplegeneticmeasures
AT dreisigackersusanne corehunteranalgorithmforsamplinggeneticresourcesbasedonmultiplegeneticmeasures
AT warburtonmarilyn corehunteranalgorithmforsamplinggeneticresourcesbasedonmultiplegeneticmeasures
AT davenportguyf corehunteranalgorithmforsamplinggeneticresourcesbasedonmultiplegeneticmeasures