Cargando…

Maximum parsimony xor haplotyping by sparse dictionary selection

BACKGROUND: Xor-genotype is a cost-effective alternative to the genotype sequence of an individual. Recent methods developed for haplotype inference have aimed at finding the solution based on xor-genotype data. Given the xor-genotypes of a group of unrelated individuals, it is possible to infer the...

Descripción completa

Detalles Bibliográficos
Autores principales: Elmas, Abdulkadir, Jajamovich, Guido H, Wang, Xiaodong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852077/
https://www.ncbi.nlm.nih.gov/pubmed/24059285
http://dx.doi.org/10.1186/1471-2164-14-645
_version_ 1782478603940790272
author Elmas, Abdulkadir
Jajamovich, Guido H
Wang, Xiaodong
author_facet Elmas, Abdulkadir
Jajamovich, Guido H
Wang, Xiaodong
author_sort Elmas, Abdulkadir
collection PubMed
description BACKGROUND: Xor-genotype is a cost-effective alternative to the genotype sequence of an individual. Recent methods developed for haplotype inference have aimed at finding the solution based on xor-genotype data. Given the xor-genotypes of a group of unrelated individuals, it is possible to infer the haplotype pairs for each individual with the aid of a small number of regular genotypes. RESULTS: We propose a framework of maximum parsimony inference of haplotypes based on the search of a sparse dictionary, and we present a greedy method that can effectively infer the haplotype pairs given a set of xor-genotypes augmented by a small number of regular genotypes. We test the performance of the proposed approach on synthetic data sets with different number of individuals and SNPs, and compare the performances with the state-of-the-art xor-haplotyping methods PPXH and XOR-HAPLOGEN. CONCLUSIONS: Experimental results show good inference qualities for the proposed method under all circumstances, especially on large data sets. Results on a real database, CFTR, also demonstrate significantly better performance. The proposed algorithm is also capable of finding accurate solutions with missing data and/or typing errors.
format Online
Article
Text
id pubmed-3852077
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38520772013-12-20 Maximum parsimony xor haplotyping by sparse dictionary selection Elmas, Abdulkadir Jajamovich, Guido H Wang, Xiaodong BMC Genomics Research Article BACKGROUND: Xor-genotype is a cost-effective alternative to the genotype sequence of an individual. Recent methods developed for haplotype inference have aimed at finding the solution based on xor-genotype data. Given the xor-genotypes of a group of unrelated individuals, it is possible to infer the haplotype pairs for each individual with the aid of a small number of regular genotypes. RESULTS: We propose a framework of maximum parsimony inference of haplotypes based on the search of a sparse dictionary, and we present a greedy method that can effectively infer the haplotype pairs given a set of xor-genotypes augmented by a small number of regular genotypes. We test the performance of the proposed approach on synthetic data sets with different number of individuals and SNPs, and compare the performances with the state-of-the-art xor-haplotyping methods PPXH and XOR-HAPLOGEN. CONCLUSIONS: Experimental results show good inference qualities for the proposed method under all circumstances, especially on large data sets. Results on a real database, CFTR, also demonstrate significantly better performance. The proposed algorithm is also capable of finding accurate solutions with missing data and/or typing errors. BioMed Central 2013-09-23 /pmc/articles/PMC3852077/ /pubmed/24059285 http://dx.doi.org/10.1186/1471-2164-14-645 Text en Copyright © 2013 Elmas et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Elmas, Abdulkadir
Jajamovich, Guido H
Wang, Xiaodong
Maximum parsimony xor haplotyping by sparse dictionary selection
title Maximum parsimony xor haplotyping by sparse dictionary selection
title_full Maximum parsimony xor haplotyping by sparse dictionary selection
title_fullStr Maximum parsimony xor haplotyping by sparse dictionary selection
title_full_unstemmed Maximum parsimony xor haplotyping by sparse dictionary selection
title_short Maximum parsimony xor haplotyping by sparse dictionary selection
title_sort maximum parsimony xor haplotyping by sparse dictionary selection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852077/
https://www.ncbi.nlm.nih.gov/pubmed/24059285
http://dx.doi.org/10.1186/1471-2164-14-645
work_keys_str_mv AT elmasabdulkadir maximumparsimonyxorhaplotypingbysparsedictionaryselection
AT jajamovichguidoh maximumparsimonyxorhaplotypingbysparsedictionaryselection
AT wangxiaodong maximumparsimonyxorhaplotypingbysparsedictionaryselection