Cargando…

WinHAP: An Efficient Haplotype Phasing Algorithm Based on Scalable Sliding Windows

Haplotype phasing represents an essential step in studying the association of genomic polymorphisms with complex genetic diseases, and in determining targets for drug designing. In recent years, huge amounts of genotype data are produced from the rapidly evolving high-throughput sequencing technolog...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Yun, Cheng, Wenhua, Nie, Pengyu, Zhou, Fengfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3419172/
https://www.ncbi.nlm.nih.gov/pubmed/22905221
http://dx.doi.org/10.1371/journal.pone.0043163
_version_ 1782240695369596928
author Xu, Yun
Cheng, Wenhua
Nie, Pengyu
Zhou, Fengfeng
author_facet Xu, Yun
Cheng, Wenhua
Nie, Pengyu
Zhou, Fengfeng
author_sort Xu, Yun
collection PubMed
description Haplotype phasing represents an essential step in studying the association of genomic polymorphisms with complex genetic diseases, and in determining targets for drug designing. In recent years, huge amounts of genotype data are produced from the rapidly evolving high-throughput sequencing technologies, and the data volume challenges the community with more efficient haplotype phasing algorithms, in the senses of both running time and overall accuracy. 2SNP is one of the fastest haplotype phasing algorithms with comparable low error rates with the other algorithms. The most time-consuming step of 2SNP is the construction of a maximum spanning tree (MST) among all the heterozygous SNP pairs. We simplified this step by replacing the MST with the initial haplotypes of adjacent heterozygous SNP pairs. The multi-SNP haplotypes were estimated within a sliding window along the chromosomes. The comparative studies on four different-scale genotype datasets suggest that our algorithm WinHAP outperforms 2SNP and most of the other haplotype phasing algorithms in terms of both running speeds and overall accuracies. To facilitate the WinHAP’s application in more practical biological datasets, we released the software for free at: http://staff.ustc.edu.cn/~xuyun/winhap/index.htm.
format Online
Article
Text
id pubmed-3419172
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34191722012-08-19 WinHAP: An Efficient Haplotype Phasing Algorithm Based on Scalable Sliding Windows Xu, Yun Cheng, Wenhua Nie, Pengyu Zhou, Fengfeng PLoS One Research Article Haplotype phasing represents an essential step in studying the association of genomic polymorphisms with complex genetic diseases, and in determining targets for drug designing. In recent years, huge amounts of genotype data are produced from the rapidly evolving high-throughput sequencing technologies, and the data volume challenges the community with more efficient haplotype phasing algorithms, in the senses of both running time and overall accuracy. 2SNP is one of the fastest haplotype phasing algorithms with comparable low error rates with the other algorithms. The most time-consuming step of 2SNP is the construction of a maximum spanning tree (MST) among all the heterozygous SNP pairs. We simplified this step by replacing the MST with the initial haplotypes of adjacent heterozygous SNP pairs. The multi-SNP haplotypes were estimated within a sliding window along the chromosomes. The comparative studies on four different-scale genotype datasets suggest that our algorithm WinHAP outperforms 2SNP and most of the other haplotype phasing algorithms in terms of both running speeds and overall accuracies. To facilitate the WinHAP’s application in more practical biological datasets, we released the software for free at: http://staff.ustc.edu.cn/~xuyun/winhap/index.htm. Public Library of Science 2012-08-14 /pmc/articles/PMC3419172/ /pubmed/22905221 http://dx.doi.org/10.1371/journal.pone.0043163 Text en © 2012 Xu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Xu, Yun
Cheng, Wenhua
Nie, Pengyu
Zhou, Fengfeng
WinHAP: An Efficient Haplotype Phasing Algorithm Based on Scalable Sliding Windows
title WinHAP: An Efficient Haplotype Phasing Algorithm Based on Scalable Sliding Windows
title_full WinHAP: An Efficient Haplotype Phasing Algorithm Based on Scalable Sliding Windows
title_fullStr WinHAP: An Efficient Haplotype Phasing Algorithm Based on Scalable Sliding Windows
title_full_unstemmed WinHAP: An Efficient Haplotype Phasing Algorithm Based on Scalable Sliding Windows
title_short WinHAP: An Efficient Haplotype Phasing Algorithm Based on Scalable Sliding Windows
title_sort winhap: an efficient haplotype phasing algorithm based on scalable sliding windows
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3419172/
https://www.ncbi.nlm.nih.gov/pubmed/22905221
http://dx.doi.org/10.1371/journal.pone.0043163
work_keys_str_mv AT xuyun winhapanefficienthaplotypephasingalgorithmbasedonscalableslidingwindows
AT chengwenhua winhapanefficienthaplotypephasingalgorithmbasedonscalableslidingwindows
AT niepengyu winhapanefficienthaplotypephasingalgorithmbasedonscalableslidingwindows
AT zhoufengfeng winhapanefficienthaplotypephasingalgorithmbasedonscalableslidingwindows