Cargando…
HAHap: a read-based haplotyping method using hierarchical assembly
BACKGROUND: The need for read-based phasing arises with advances in sequencing technologies. The minimum error correction (MEC) approach is the primary trend to resolve haplotypes by reducing conflicts in a single nucleotide polymorphism-fragment matrix. However, it is frequently observed that the s...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6214236/ https://www.ncbi.nlm.nih.gov/pubmed/30397550 http://dx.doi.org/10.7717/peerj.5852 |
_version_ | 1783367945715449856 |
---|---|
author | Lin, Yu-Yu Wu, Ping Chun Chen, Pei-Lung Oyang, Yen-Jen Chen, Chien-Yu |
author_facet | Lin, Yu-Yu Wu, Ping Chun Chen, Pei-Lung Oyang, Yen-Jen Chen, Chien-Yu |
author_sort | Lin, Yu-Yu |
collection | PubMed |
description | BACKGROUND: The need for read-based phasing arises with advances in sequencing technologies. The minimum error correction (MEC) approach is the primary trend to resolve haplotypes by reducing conflicts in a single nucleotide polymorphism-fragment matrix. However, it is frequently observed that the solution with the optimal MEC might not be the real haplotypes, due to the fact that MEC methods consider all positions together and sometimes the conflicts in noisy regions might mislead the selection of corrections. To tackle this problem, we present a hierarchical assembly-based method designed to progressively resolve local conflicts. RESULTS: This study presents HAHap, a new phasing algorithm based on hierarchical assembly. HAHap leverages high-confident variant pairs to build haplotypes progressively. The phasing results by HAHap on both real and simulated data, compared to other MEC-based methods, revealed better phasing error rates for constructing haplotypes using short reads from whole-genome sequencing. We compared the number of error corrections (ECs) on real data with other methods, and it reveals the ability of HAHap to predict haplotypes with a lower number of ECs. We also used simulated data to investigate the behavior of HAHap under different sequencing conditions, highlighting the applicability of HAHap in certain situations. |
format | Online Article Text |
id | pubmed-6214236 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-62142362018-11-05 HAHap: a read-based haplotyping method using hierarchical assembly Lin, Yu-Yu Wu, Ping Chun Chen, Pei-Lung Oyang, Yen-Jen Chen, Chien-Yu PeerJ Bioinformatics BACKGROUND: The need for read-based phasing arises with advances in sequencing technologies. The minimum error correction (MEC) approach is the primary trend to resolve haplotypes by reducing conflicts in a single nucleotide polymorphism-fragment matrix. However, it is frequently observed that the solution with the optimal MEC might not be the real haplotypes, due to the fact that MEC methods consider all positions together and sometimes the conflicts in noisy regions might mislead the selection of corrections. To tackle this problem, we present a hierarchical assembly-based method designed to progressively resolve local conflicts. RESULTS: This study presents HAHap, a new phasing algorithm based on hierarchical assembly. HAHap leverages high-confident variant pairs to build haplotypes progressively. The phasing results by HAHap on both real and simulated data, compared to other MEC-based methods, revealed better phasing error rates for constructing haplotypes using short reads from whole-genome sequencing. We compared the number of error corrections (ECs) on real data with other methods, and it reveals the ability of HAHap to predict haplotypes with a lower number of ECs. We also used simulated data to investigate the behavior of HAHap under different sequencing conditions, highlighting the applicability of HAHap in certain situations. PeerJ Inc. 2018-10-30 /pmc/articles/PMC6214236/ /pubmed/30397550 http://dx.doi.org/10.7717/peerj.5852 Text en © 2018 Lin et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Lin, Yu-Yu Wu, Ping Chun Chen, Pei-Lung Oyang, Yen-Jen Chen, Chien-Yu HAHap: a read-based haplotyping method using hierarchical assembly |
title | HAHap: a read-based haplotyping method using hierarchical assembly |
title_full | HAHap: a read-based haplotyping method using hierarchical assembly |
title_fullStr | HAHap: a read-based haplotyping method using hierarchical assembly |
title_full_unstemmed | HAHap: a read-based haplotyping method using hierarchical assembly |
title_short | HAHap: a read-based haplotyping method using hierarchical assembly |
title_sort | hahap: a read-based haplotyping method using hierarchical assembly |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6214236/ https://www.ncbi.nlm.nih.gov/pubmed/30397550 http://dx.doi.org/10.7717/peerj.5852 |
work_keys_str_mv | AT linyuyu hahapareadbasedhaplotypingmethodusinghierarchicalassembly AT wupingchun hahapareadbasedhaplotypingmethodusinghierarchicalassembly AT chenpeilung hahapareadbasedhaplotypingmethodusinghierarchicalassembly AT oyangyenjen hahapareadbasedhaplotypingmethodusinghierarchicalassembly AT chenchienyu hahapareadbasedhaplotypingmethodusinghierarchicalassembly |