Cargando…
WinHAP2: an extremely fast haplotype phasing program for long genotype sequences
BACKGROUND: The haplotype phasing problem tries to screen for phenotype associated genomic variations from millions of candidate data. Most of the current computer programs handle this problem with high requirements of computing power and memory. By replacing the computation-intensive step of constr...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4094983/ https://www.ncbi.nlm.nih.gov/pubmed/24884701 http://dx.doi.org/10.1186/1471-2105-15-164 |
_version_ | 1782325934089568256 |
---|---|
author | Pan, Weihua Zhao, Yanan Xu, Yun Zhou, Fengfeng |
author_facet | Pan, Weihua Zhao, Yanan Xu, Yun Zhou, Fengfeng |
author_sort | Pan, Weihua |
collection | PubMed |
description | BACKGROUND: The haplotype phasing problem tries to screen for phenotype associated genomic variations from millions of candidate data. Most of the current computer programs handle this problem with high requirements of computing power and memory. By replacing the computation-intensive step of constructing the maximum spanning tree with a heuristics of estimated initial haplotype, we released the WinHAP algorithm version 1.0, which outperforms the other algorithms in terms of both running speed and overall accuracy. RESULTS: This work further speeds up the WinHAP algorithm to version 2.0 (WinHAP2) by utilizing the divide-and-conquer strategy and the OpenMP parallel computing mode. WinHAP2 can phase 500 genotypes with 1,000,000 SNPs using just 12.8 MB in memory and 2.5 hours on a personal computer, whereas the other programs require unacceptable memory or running times. The parallel running mode further improves WinHAP2's running speed with several orders of magnitudes, compared with the other programs, including Beagle, SHAPEIT2 and 2SNP. CONCLUSIONS: WinHAP2 is an extremely fast haplotype phasing program which can handle a large-scale genotyping study with any number of SNPs in the current literature and at least in the near future. |
format | Online Article Text |
id | pubmed-4094983 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40949832014-07-23 WinHAP2: an extremely fast haplotype phasing program for long genotype sequences Pan, Weihua Zhao, Yanan Xu, Yun Zhou, Fengfeng BMC Bioinformatics Software BACKGROUND: The haplotype phasing problem tries to screen for phenotype associated genomic variations from millions of candidate data. Most of the current computer programs handle this problem with high requirements of computing power and memory. By replacing the computation-intensive step of constructing the maximum spanning tree with a heuristics of estimated initial haplotype, we released the WinHAP algorithm version 1.0, which outperforms the other algorithms in terms of both running speed and overall accuracy. RESULTS: This work further speeds up the WinHAP algorithm to version 2.0 (WinHAP2) by utilizing the divide-and-conquer strategy and the OpenMP parallel computing mode. WinHAP2 can phase 500 genotypes with 1,000,000 SNPs using just 12.8 MB in memory and 2.5 hours on a personal computer, whereas the other programs require unacceptable memory or running times. The parallel running mode further improves WinHAP2's running speed with several orders of magnitudes, compared with the other programs, including Beagle, SHAPEIT2 and 2SNP. CONCLUSIONS: WinHAP2 is an extremely fast haplotype phasing program which can handle a large-scale genotyping study with any number of SNPs in the current literature and at least in the near future. BioMed Central 2014-05-30 /pmc/articles/PMC4094983/ /pubmed/24884701 http://dx.doi.org/10.1186/1471-2105-15-164 Text en Copyright © 2014 Pan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Pan, Weihua Zhao, Yanan Xu, Yun Zhou, Fengfeng WinHAP2: an extremely fast haplotype phasing program for long genotype sequences |
title | WinHAP2: an extremely fast haplotype phasing program for long genotype sequences |
title_full | WinHAP2: an extremely fast haplotype phasing program for long genotype sequences |
title_fullStr | WinHAP2: an extremely fast haplotype phasing program for long genotype sequences |
title_full_unstemmed | WinHAP2: an extremely fast haplotype phasing program for long genotype sequences |
title_short | WinHAP2: an extremely fast haplotype phasing program for long genotype sequences |
title_sort | winhap2: an extremely fast haplotype phasing program for long genotype sequences |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4094983/ https://www.ncbi.nlm.nih.gov/pubmed/24884701 http://dx.doi.org/10.1186/1471-2105-15-164 |
work_keys_str_mv | AT panweihua winhap2anextremelyfasthaplotypephasingprogramforlonggenotypesequences AT zhaoyanan winhap2anextremelyfasthaplotypephasingprogramforlonggenotypesequences AT xuyun winhap2anextremelyfasthaplotypephasingprogramforlonggenotypesequences AT zhoufengfeng winhap2anextremelyfasthaplotypephasingprogramforlonggenotypesequences |