Cargando…
HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations
BACKGROUND: Pooling techniques, where multiple sub-samples are mixed in a single sample, are widely used to take full advantage of high-throughput DNA sequencing. Recently, Ranjard et al. (PLoS ONE 13:0195090, 2018) proposed a pooling strategy without the use of barcodes. Three sub-samples were mixe...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198429/ https://www.ncbi.nlm.nih.gov/pubmed/30348075 http://dx.doi.org/10.1186/s12859-018-2424-7 |
_version_ | 1783364964528947200 |
---|---|
author | Wong, Thomas K. F. Ranjard, Louis Lin, Yu Rodrigo, Allen G. |
author_facet | Wong, Thomas K. F. Ranjard, Louis Lin, Yu Rodrigo, Allen G. |
author_sort | Wong, Thomas K. F. |
collection | PubMed |
description | BACKGROUND: Pooling techniques, where multiple sub-samples are mixed in a single sample, are widely used to take full advantage of high-throughput DNA sequencing. Recently, Ranjard et al. (PLoS ONE 13:0195090, 2018) proposed a pooling strategy without the use of barcodes. Three sub-samples were mixed in different known proportions (i.e. 62.5%, 25% and 12.5%), and a method was developed to use these proportions to reconstruct the three haplotypes effectively. RESULTS: HaploJuice provides an alternative haplotype reconstruction algorithm for Ranjard et al.’s pooling strategy. HaploJuice significantly increases the accuracy by first identifying the empirical proportions of the three mixed sub-samples and then assembling the haplotypes using a dynamic programming approach. HaploJuice was evaluated against five different assembly algorithms, Hmmfreq (Ranjard et al., PLoS ONE 13:0195090, 2018), ShoRAH (Zagordi et al., BMC Bioinformatics 12:119, 2011), SAVAGE (Baaijens et al., Genome Res 27:835-848, 2017), PredictHaplo (Prabhakaran et al., IEEE/ACM Trans Comput Biol Bioinform 11:182-91, 2014) and QuRe (Prosperi and Salemi, Bioinformatics 28:132-3, 2012). Using simulated and real data sets, HaploJuice reconstructed the true sequences with the highest coverage and the lowest error rate. CONCLUSION: HaploJuice provides high accuracy in haplotype reconstruction, making Ranjard et al.’s pooling strategy more efficient, feasible, and applicable, with the benefit of reducing the sequencing cost. |
format | Online Article Text |
id | pubmed-6198429 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-61984292018-10-31 HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations Wong, Thomas K. F. Ranjard, Louis Lin, Yu Rodrigo, Allen G. BMC Bioinformatics Methodology Article BACKGROUND: Pooling techniques, where multiple sub-samples are mixed in a single sample, are widely used to take full advantage of high-throughput DNA sequencing. Recently, Ranjard et al. (PLoS ONE 13:0195090, 2018) proposed a pooling strategy without the use of barcodes. Three sub-samples were mixed in different known proportions (i.e. 62.5%, 25% and 12.5%), and a method was developed to use these proportions to reconstruct the three haplotypes effectively. RESULTS: HaploJuice provides an alternative haplotype reconstruction algorithm for Ranjard et al.’s pooling strategy. HaploJuice significantly increases the accuracy by first identifying the empirical proportions of the three mixed sub-samples and then assembling the haplotypes using a dynamic programming approach. HaploJuice was evaluated against five different assembly algorithms, Hmmfreq (Ranjard et al., PLoS ONE 13:0195090, 2018), ShoRAH (Zagordi et al., BMC Bioinformatics 12:119, 2011), SAVAGE (Baaijens et al., Genome Res 27:835-848, 2017), PredictHaplo (Prabhakaran et al., IEEE/ACM Trans Comput Biol Bioinform 11:182-91, 2014) and QuRe (Prosperi and Salemi, Bioinformatics 28:132-3, 2012). Using simulated and real data sets, HaploJuice reconstructed the true sequences with the highest coverage and the lowest error rate. CONCLUSION: HaploJuice provides high accuracy in haplotype reconstruction, making Ranjard et al.’s pooling strategy more efficient, feasible, and applicable, with the benefit of reducing the sequencing cost. BioMed Central 2018-10-22 /pmc/articles/PMC6198429/ /pubmed/30348075 http://dx.doi.org/10.1186/s12859-018-2424-7 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Wong, Thomas K. F. Ranjard, Louis Lin, Yu Rodrigo, Allen G. HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations |
title | HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations |
title_full | HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations |
title_fullStr | HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations |
title_full_unstemmed | HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations |
title_short | HaploJuice : accurate haplotype assembly from a pool of sequences with known relative concentrations |
title_sort | haplojuice : accurate haplotype assembly from a pool of sequences with known relative concentrations |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198429/ https://www.ncbi.nlm.nih.gov/pubmed/30348075 http://dx.doi.org/10.1186/s12859-018-2424-7 |
work_keys_str_mv | AT wongthomaskf haplojuiceaccuratehaplotypeassemblyfromapoolofsequenceswithknownrelativeconcentrations AT ranjardlouis haplojuiceaccuratehaplotypeassemblyfromapoolofsequenceswithknownrelativeconcentrations AT linyu haplojuiceaccuratehaplotypeassemblyfromapoolofsequenceswithknownrelativeconcentrations AT rodrigoalleng haplojuiceaccuratehaplotypeassemblyfromapoolofsequenceswithknownrelativeconcentrations |