Cargando…

HAHap: a read-based haplotyping method using hierarchical assembly

BACKGROUND: The need for read-based phasing arises with advances in sequencing technologies. The minimum error correction (MEC) approach is the primary trend to resolve haplotypes by reducing conflicts in a single nucleotide polymorphism-fragment matrix. However, it is frequently observed that the s...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Yu-Yu, Wu, Ping Chun, Chen, Pei-Lung, Oyang, Yen-Jen, Chen, Chien-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6214236/
https://www.ncbi.nlm.nih.gov/pubmed/30397550
http://dx.doi.org/10.7717/peerj.5852
_version_ 1783367945715449856
author Lin, Yu-Yu
Wu, Ping Chun
Chen, Pei-Lung
Oyang, Yen-Jen
Chen, Chien-Yu
author_facet Lin, Yu-Yu
Wu, Ping Chun
Chen, Pei-Lung
Oyang, Yen-Jen
Chen, Chien-Yu
author_sort Lin, Yu-Yu
collection PubMed
description BACKGROUND: The need for read-based phasing arises with advances in sequencing technologies. The minimum error correction (MEC) approach is the primary trend to resolve haplotypes by reducing conflicts in a single nucleotide polymorphism-fragment matrix. However, it is frequently observed that the solution with the optimal MEC might not be the real haplotypes, due to the fact that MEC methods consider all positions together and sometimes the conflicts in noisy regions might mislead the selection of corrections. To tackle this problem, we present a hierarchical assembly-based method designed to progressively resolve local conflicts. RESULTS: This study presents HAHap, a new phasing algorithm based on hierarchical assembly. HAHap leverages high-confident variant pairs to build haplotypes progressively. The phasing results by HAHap on both real and simulated data, compared to other MEC-based methods, revealed better phasing error rates for constructing haplotypes using short reads from whole-genome sequencing. We compared the number of error corrections (ECs) on real data with other methods, and it reveals the ability of HAHap to predict haplotypes with a lower number of ECs. We also used simulated data to investigate the behavior of HAHap under different sequencing conditions, highlighting the applicability of HAHap in certain situations.
format Online
Article
Text
id pubmed-6214236
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-62142362018-11-05 HAHap: a read-based haplotyping method using hierarchical assembly Lin, Yu-Yu Wu, Ping Chun Chen, Pei-Lung Oyang, Yen-Jen Chen, Chien-Yu PeerJ Bioinformatics BACKGROUND: The need for read-based phasing arises with advances in sequencing technologies. The minimum error correction (MEC) approach is the primary trend to resolve haplotypes by reducing conflicts in a single nucleotide polymorphism-fragment matrix. However, it is frequently observed that the solution with the optimal MEC might not be the real haplotypes, due to the fact that MEC methods consider all positions together and sometimes the conflicts in noisy regions might mislead the selection of corrections. To tackle this problem, we present a hierarchical assembly-based method designed to progressively resolve local conflicts. RESULTS: This study presents HAHap, a new phasing algorithm based on hierarchical assembly. HAHap leverages high-confident variant pairs to build haplotypes progressively. The phasing results by HAHap on both real and simulated data, compared to other MEC-based methods, revealed better phasing error rates for constructing haplotypes using short reads from whole-genome sequencing. We compared the number of error corrections (ECs) on real data with other methods, and it reveals the ability of HAHap to predict haplotypes with a lower number of ECs. We also used simulated data to investigate the behavior of HAHap under different sequencing conditions, highlighting the applicability of HAHap in certain situations. PeerJ Inc. 2018-10-30 /pmc/articles/PMC6214236/ /pubmed/30397550 http://dx.doi.org/10.7717/peerj.5852 Text en © 2018 Lin et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Lin, Yu-Yu
Wu, Ping Chun
Chen, Pei-Lung
Oyang, Yen-Jen
Chen, Chien-Yu
HAHap: a read-based haplotyping method using hierarchical assembly
title HAHap: a read-based haplotyping method using hierarchical assembly
title_full HAHap: a read-based haplotyping method using hierarchical assembly
title_fullStr HAHap: a read-based haplotyping method using hierarchical assembly
title_full_unstemmed HAHap: a read-based haplotyping method using hierarchical assembly
title_short HAHap: a read-based haplotyping method using hierarchical assembly
title_sort hahap: a read-based haplotyping method using hierarchical assembly
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6214236/
https://www.ncbi.nlm.nih.gov/pubmed/30397550
http://dx.doi.org/10.7717/peerj.5852
work_keys_str_mv AT linyuyu hahapareadbasedhaplotypingmethodusinghierarchicalassembly
AT wupingchun hahapareadbasedhaplotypingmethodusinghierarchicalassembly
AT chenpeilung hahapareadbasedhaplotypingmethodusinghierarchicalassembly
AT oyangyenjen hahapareadbasedhaplotypingmethodusinghierarchicalassembly
AT chenchienyu hahapareadbasedhaplotypingmethodusinghierarchicalassembly