Cargando…

HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations

BACKGROUND: Pooling techniques, where multiple sub-samples are mixed in a single sample, are widely used to take full advantage of high-throughput DNA sequencing. Recently, Ranjard et al. (PLoS ONE 13:0195090, 2018) proposed a pooling strategy without the use of barcodes. Three sub-samples were mixe...

Descripción completa

Detalles Bibliográficos
Autores principales: Wong, Thomas K. F., Ranjard, Louis, Lin, Yu, Rodrigo, Allen G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198429/
https://www.ncbi.nlm.nih.gov/pubmed/30348075
http://dx.doi.org/10.1186/s12859-018-2424-7
_version_ 1783364964528947200
author Wong, Thomas K. F.
Ranjard, Louis
Lin, Yu
Rodrigo, Allen G.
author_facet Wong, Thomas K. F.
Ranjard, Louis
Lin, Yu
Rodrigo, Allen G.
author_sort Wong, Thomas K. F.
collection PubMed
description BACKGROUND: Pooling techniques, where multiple sub-samples are mixed in a single sample, are widely used to take full advantage of high-throughput DNA sequencing. Recently, Ranjard et al. (PLoS ONE 13:0195090, 2018) proposed a pooling strategy without the use of barcodes. Three sub-samples were mixed in different known proportions (i.e. 62.5%, 25% and 12.5%), and a method was developed to use these proportions to reconstruct the three haplotypes effectively. RESULTS: HaploJuice provides an alternative haplotype reconstruction algorithm for Ranjard et al.’s pooling strategy. HaploJuice significantly increases the accuracy by first identifying the empirical proportions of the three mixed sub-samples and then assembling the haplotypes using a dynamic programming approach. HaploJuice was evaluated against five different assembly algorithms, Hmmfreq (Ranjard et al., PLoS ONE 13:0195090, 2018), ShoRAH (Zagordi et al., BMC Bioinformatics 12:119, 2011), SAVAGE (Baaijens et al., Genome Res 27:835-848, 2017), PredictHaplo (Prabhakaran et al., IEEE/ACM Trans Comput Biol Bioinform 11:182-91, 2014) and QuRe (Prosperi and Salemi, Bioinformatics 28:132-3, 2012). Using simulated and real data sets, HaploJuice reconstructed the true sequences with the highest coverage and the lowest error rate. CONCLUSION: HaploJuice provides high accuracy in haplotype reconstruction, making Ranjard et al.’s pooling strategy more efficient, feasible, and applicable, with the benefit of reducing the sequencing cost.
format Online
Article
Text
id pubmed-6198429
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61984292018-10-31 HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations Wong, Thomas K. F. Ranjard, Louis Lin, Yu Rodrigo, Allen G. BMC Bioinformatics Methodology Article BACKGROUND: Pooling techniques, where multiple sub-samples are mixed in a single sample, are widely used to take full advantage of high-throughput DNA sequencing. Recently, Ranjard et al. (PLoS ONE 13:0195090, 2018) proposed a pooling strategy without the use of barcodes. Three sub-samples were mixed in different known proportions (i.e. 62.5%, 25% and 12.5%), and a method was developed to use these proportions to reconstruct the three haplotypes effectively. RESULTS: HaploJuice provides an alternative haplotype reconstruction algorithm for Ranjard et al.’s pooling strategy. HaploJuice significantly increases the accuracy by first identifying the empirical proportions of the three mixed sub-samples and then assembling the haplotypes using a dynamic programming approach. HaploJuice was evaluated against five different assembly algorithms, Hmmfreq (Ranjard et al., PLoS ONE 13:0195090, 2018), ShoRAH (Zagordi et al., BMC Bioinformatics 12:119, 2011), SAVAGE (Baaijens et al., Genome Res 27:835-848, 2017), PredictHaplo (Prabhakaran et al., IEEE/ACM Trans Comput Biol Bioinform 11:182-91, 2014) and QuRe (Prosperi and Salemi, Bioinformatics 28:132-3, 2012). Using simulated and real data sets, HaploJuice reconstructed the true sequences with the highest coverage and the lowest error rate. CONCLUSION: HaploJuice provides high accuracy in haplotype reconstruction, making Ranjard et al.’s pooling strategy more efficient, feasible, and applicable, with the benefit of reducing the sequencing cost. BioMed Central 2018-10-22 /pmc/articles/PMC6198429/ /pubmed/30348075 http://dx.doi.org/10.1186/s12859-018-2424-7 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wong, Thomas K. F.
Ranjard, Louis
Lin, Yu
Rodrigo, Allen G.
HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations
title HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations
title_full HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations
title_fullStr HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations
title_full_unstemmed HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations
title_short HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations
title_sort haplojuice : accurate haplotype assembly from a pool of sequences with known relative concentrations
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198429/
https://www.ncbi.nlm.nih.gov/pubmed/30348075
http://dx.doi.org/10.1186/s12859-018-2424-7
work_keys_str_mv AT wongthomaskf haplojuiceaccuratehaplotypeassemblyfromapoolofsequenceswithknownrelativeconcentrations
AT ranjardlouis haplojuiceaccuratehaplotypeassemblyfromapoolofsequenceswithknownrelativeconcentrations
AT linyu haplojuiceaccuratehaplotypeassemblyfromapoolofsequenceswithknownrelativeconcentrations
AT rodrigoalleng haplojuiceaccuratehaplotypeassemblyfromapoolofsequenceswithknownrelativeconcentrations