Cargando…

Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT

BACKGROUND: Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid spe...

Descripción completa

Detalles Bibliográficos
Autores principales: Neigenfind, Jost, Gyetvai, Gabor, Basekow, Rico, Diehl, Svenja, Achenbach, Ute, Gebhardt, Christiane, Selbig, Joachim, Kersten, Birgit
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2566320/
https://www.ncbi.nlm.nih.gov/pubmed/18667059
http://dx.doi.org/10.1186/1471-2164-9-356
_version_ 1782159943623770112
author Neigenfind, Jost
Gyetvai, Gabor
Basekow, Rico
Diehl, Svenja
Achenbach, Ute
Gebhardt, Christiane
Selbig, Joachim
Kersten, Birgit
author_facet Neigenfind, Jost
Gyetvai, Gabor
Basekow, Rico
Diehl, Svenja
Achenbach, Ute
Gebhardt, Christiane
Selbig, Joachim
Kersten, Birgit
author_sort Neigenfind, Jost
collection PubMed
description BACKGROUND: Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid species, such as the cultivated potato (Solanum tuberosum). Potato species are tetraploid and highly heterozygous. RESULTS: Here we present the software SATlotyper which is able to handle polyploid and polyallelic data. SATlo-typer uses the Boolean satisfiability problem to formulate Haplotype Inference by Pure Parsimony. The software excludes existing haplotype inferences, thus allowing for calculation of alternative inferences. As it is not known which of the multiple haplotype inferences are best supported by the given unphased data set, we use a bootstrapping procedure that allows for scoring of alternative inferences. Finally, by means of the bootstrapping scores, it is possible to optimise the phased genotypes belonging to a given haplotype inference. The program is evaluated with simulated and experimental SNP data generated for heterozygous tetraploid populations of potato. We show that, instead of taking the first haplotype inference reported by the program, we can significantly improve the quality of the final result by applying additional methods that include scoring of the alternative haplotype inferences and genotype optimisation. For a sub-population of nineteen individuals, the predicted results computed by SATlotyper were directly compared with results obtained by experimental haplotype inference via sequencing of cloned amplicons. Prediction and experiment gave similar results regarding the inferred haplotypes and phased genotypes. CONCLUSION: Our results suggest that Haplotype Inference by Pure Parsimony can be solved efficiently by the SAT approach, even for data sets of unphased SNP from heterozygous polyploids. SATlotyper is freeware and is distributed as a Java JAR file. The software can be downloaded from the webpage of the GABI Primary Database at . The application of SATlotyper will provide haplotype information, which can be used in haplotype association mapping studies of polyploid plants.
format Text
id pubmed-2566320
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25663202008-10-15 Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT Neigenfind, Jost Gyetvai, Gabor Basekow, Rico Diehl, Svenja Achenbach, Ute Gebhardt, Christiane Selbig, Joachim Kersten, Birgit BMC Genomics Software BACKGROUND: Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid species, such as the cultivated potato (Solanum tuberosum). Potato species are tetraploid and highly heterozygous. RESULTS: Here we present the software SATlotyper which is able to handle polyploid and polyallelic data. SATlo-typer uses the Boolean satisfiability problem to formulate Haplotype Inference by Pure Parsimony. The software excludes existing haplotype inferences, thus allowing for calculation of alternative inferences. As it is not known which of the multiple haplotype inferences are best supported by the given unphased data set, we use a bootstrapping procedure that allows for scoring of alternative inferences. Finally, by means of the bootstrapping scores, it is possible to optimise the phased genotypes belonging to a given haplotype inference. The program is evaluated with simulated and experimental SNP data generated for heterozygous tetraploid populations of potato. We show that, instead of taking the first haplotype inference reported by the program, we can significantly improve the quality of the final result by applying additional methods that include scoring of the alternative haplotype inferences and genotype optimisation. For a sub-population of nineteen individuals, the predicted results computed by SATlotyper were directly compared with results obtained by experimental haplotype inference via sequencing of cloned amplicons. Prediction and experiment gave similar results regarding the inferred haplotypes and phased genotypes. CONCLUSION: Our results suggest that Haplotype Inference by Pure Parsimony can be solved efficiently by the SAT approach, even for data sets of unphased SNP from heterozygous polyploids. SATlotyper is freeware and is distributed as a Java JAR file. The software can be downloaded from the webpage of the GABI Primary Database at . The application of SATlotyper will provide haplotype information, which can be used in haplotype association mapping studies of polyploid plants. BioMed Central 2008-07-30 /pmc/articles/PMC2566320/ /pubmed/18667059 http://dx.doi.org/10.1186/1471-2164-9-356 Text en Copyright © 2008 Neigenfind et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Neigenfind, Jost
Gyetvai, Gabor
Basekow, Rico
Diehl, Svenja
Achenbach, Ute
Gebhardt, Christiane
Selbig, Joachim
Kersten, Birgit
Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_full Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_fullStr Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_full_unstemmed Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_short Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_sort haplotype inference from unphased snp data in heterozygous polyploids based on sat
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2566320/
https://www.ncbi.nlm.nih.gov/pubmed/18667059
http://dx.doi.org/10.1186/1471-2164-9-356
work_keys_str_mv AT neigenfindjost haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT gyetvaigabor haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT basekowrico haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT diehlsvenja haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT achenbachute haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT gebhardtchristiane haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT selbigjoachim haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT kerstenbirgit haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat