Cargando…

WinHAP2: an extremely fast haplotype phasing program for long genotype sequences

BACKGROUND: The haplotype phasing problem tries to screen for phenotype associated genomic variations from millions of candidate data. Most of the current computer programs handle this problem with high requirements of computing power and memory. By replacing the computation-intensive step of constr...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Weihua, Zhao, Yanan, Xu, Yun, Zhou, Fengfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4094983/
https://www.ncbi.nlm.nih.gov/pubmed/24884701
http://dx.doi.org/10.1186/1471-2105-15-164
_version_ 1782325934089568256
author Pan, Weihua
Zhao, Yanan
Xu, Yun
Zhou, Fengfeng
author_facet Pan, Weihua
Zhao, Yanan
Xu, Yun
Zhou, Fengfeng
author_sort Pan, Weihua
collection PubMed
description BACKGROUND: The haplotype phasing problem tries to screen for phenotype associated genomic variations from millions of candidate data. Most of the current computer programs handle this problem with high requirements of computing power and memory. By replacing the computation-intensive step of constructing the maximum spanning tree with a heuristics of estimated initial haplotype, we released the WinHAP algorithm version 1.0, which outperforms the other algorithms in terms of both running speed and overall accuracy. RESULTS: This work further speeds up the WinHAP algorithm to version 2.0 (WinHAP2) by utilizing the divide-and-conquer strategy and the OpenMP parallel computing mode. WinHAP2 can phase 500 genotypes with 1,000,000 SNPs using just 12.8 MB in memory and 2.5 hours on a personal computer, whereas the other programs require unacceptable memory or running times. The parallel running mode further improves WinHAP2's running speed with several orders of magnitudes, compared with the other programs, including Beagle, SHAPEIT2 and 2SNP. CONCLUSIONS: WinHAP2 is an extremely fast haplotype phasing program which can handle a large-scale genotyping study with any number of SNPs in the current literature and at least in the near future.
format Online
Article
Text
id pubmed-4094983
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40949832014-07-23 WinHAP2: an extremely fast haplotype phasing program for long genotype sequences Pan, Weihua Zhao, Yanan Xu, Yun Zhou, Fengfeng BMC Bioinformatics Software BACKGROUND: The haplotype phasing problem tries to screen for phenotype associated genomic variations from millions of candidate data. Most of the current computer programs handle this problem with high requirements of computing power and memory. By replacing the computation-intensive step of constructing the maximum spanning tree with a heuristics of estimated initial haplotype, we released the WinHAP algorithm version 1.0, which outperforms the other algorithms in terms of both running speed and overall accuracy. RESULTS: This work further speeds up the WinHAP algorithm to version 2.0 (WinHAP2) by utilizing the divide-and-conquer strategy and the OpenMP parallel computing mode. WinHAP2 can phase 500 genotypes with 1,000,000 SNPs using just 12.8 MB in memory and 2.5 hours on a personal computer, whereas the other programs require unacceptable memory or running times. The parallel running mode further improves WinHAP2's running speed with several orders of magnitudes, compared with the other programs, including Beagle, SHAPEIT2 and 2SNP. CONCLUSIONS: WinHAP2 is an extremely fast haplotype phasing program which can handle a large-scale genotyping study with any number of SNPs in the current literature and at least in the near future. BioMed Central 2014-05-30 /pmc/articles/PMC4094983/ /pubmed/24884701 http://dx.doi.org/10.1186/1471-2105-15-164 Text en Copyright © 2014 Pan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Pan, Weihua
Zhao, Yanan
Xu, Yun
Zhou, Fengfeng
WinHAP2: an extremely fast haplotype phasing program for long genotype sequences
title WinHAP2: an extremely fast haplotype phasing program for long genotype sequences
title_full WinHAP2: an extremely fast haplotype phasing program for long genotype sequences
title_fullStr WinHAP2: an extremely fast haplotype phasing program for long genotype sequences
title_full_unstemmed WinHAP2: an extremely fast haplotype phasing program for long genotype sequences
title_short WinHAP2: an extremely fast haplotype phasing program for long genotype sequences
title_sort winhap2: an extremely fast haplotype phasing program for long genotype sequences
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4094983/
https://www.ncbi.nlm.nih.gov/pubmed/24884701
http://dx.doi.org/10.1186/1471-2105-15-164
work_keys_str_mv AT panweihua winhap2anextremelyfasthaplotypephasingprogramforlonggenotypesequences
AT zhaoyanan winhap2anextremelyfasthaplotypephasingprogramforlonggenotypesequences
AT xuyun winhap2anextremelyfasthaplotypephasingprogramforlonggenotypesequences
AT zhoufengfeng winhap2anextremelyfasthaplotypephasingprogramforlonggenotypesequences