Cargando…

Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations

Clonal populations accumulate mutations over time, resulting in different haplotypes. Deep sequencing of such a population in principle provides information to reconstruct these haplotypes and the frequency at which the haplotypes occur. However, this reconstruction is technically not trivial, espec...

Descripción completa

Detalles Bibliográficos
Autores principales: Pulido-Tamayo, Sergio, Sánchez-Rodríguez, Aminael, Swings, Toon, Van den Bergh, Bram, Dubey, Akanksha, Steenackers, Hans, Michiels, Jan, Fostier, Jan, Marchal, Kathleen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4652744/
https://www.ncbi.nlm.nih.gov/pubmed/25990729
http://dx.doi.org/10.1093/nar/gkv478
_version_ 1782401811439681536
author Pulido-Tamayo, Sergio
Sánchez-Rodríguez, Aminael
Swings, Toon
Van den Bergh, Bram
Dubey, Akanksha
Steenackers, Hans
Michiels, Jan
Fostier, Jan
Marchal, Kathleen
author_facet Pulido-Tamayo, Sergio
Sánchez-Rodríguez, Aminael
Swings, Toon
Van den Bergh, Bram
Dubey, Akanksha
Steenackers, Hans
Michiels, Jan
Fostier, Jan
Marchal, Kathleen
author_sort Pulido-Tamayo, Sergio
collection PubMed
description Clonal populations accumulate mutations over time, resulting in different haplotypes. Deep sequencing of such a population in principle provides information to reconstruct these haplotypes and the frequency at which the haplotypes occur. However, this reconstruction is technically not trivial, especially not in clonal systems with a relatively low mutation frequency. The low number of segregating sites in those systems adds ambiguity to the haplotype phasing and thus obviates the reconstruction of genome-wide haplotypes based on sequence overlap information. Therefore, we present EVORhA, a haplotype reconstruction method that complements phasing information in the non-empty read overlap with the frequency estimations of inferred local haplotypes. As was shown with simulated data, as soon as read lengths and/or mutation rates become restrictive for state-of-the-art methods, the use of this additional frequency information allows EVORhA to still reliably reconstruct genome-wide haplotypes. On real data, we show the applicability of the method in reconstructing the population composition of evolved bacterial populations and in decomposing mixed bacterial infections from clinical samples.
format Online
Article
Text
id pubmed-4652744
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-46527442015-11-25 Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations Pulido-Tamayo, Sergio Sánchez-Rodríguez, Aminael Swings, Toon Van den Bergh, Bram Dubey, Akanksha Steenackers, Hans Michiels, Jan Fostier, Jan Marchal, Kathleen Nucleic Acids Res Methods Online Clonal populations accumulate mutations over time, resulting in different haplotypes. Deep sequencing of such a population in principle provides information to reconstruct these haplotypes and the frequency at which the haplotypes occur. However, this reconstruction is technically not trivial, especially not in clonal systems with a relatively low mutation frequency. The low number of segregating sites in those systems adds ambiguity to the haplotype phasing and thus obviates the reconstruction of genome-wide haplotypes based on sequence overlap information. Therefore, we present EVORhA, a haplotype reconstruction method that complements phasing information in the non-empty read overlap with the frequency estimations of inferred local haplotypes. As was shown with simulated data, as soon as read lengths and/or mutation rates become restrictive for state-of-the-art methods, the use of this additional frequency information allows EVORhA to still reliably reconstruct genome-wide haplotypes. On real data, we show the applicability of the method in reconstructing the population composition of evolved bacterial populations and in decomposing mixed bacterial infections from clinical samples. Oxford University Press 2015-09-18 2015-05-18 /pmc/articles/PMC4652744/ /pubmed/25990729 http://dx.doi.org/10.1093/nar/gkv478 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Pulido-Tamayo, Sergio
Sánchez-Rodríguez, Aminael
Swings, Toon
Van den Bergh, Bram
Dubey, Akanksha
Steenackers, Hans
Michiels, Jan
Fostier, Jan
Marchal, Kathleen
Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations
title Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations
title_full Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations
title_fullStr Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations
title_full_unstemmed Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations
title_short Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations
title_sort frequency-based haplotype reconstruction from deep sequencing data of bacterial populations
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4652744/
https://www.ncbi.nlm.nih.gov/pubmed/25990729
http://dx.doi.org/10.1093/nar/gkv478
work_keys_str_mv AT pulidotamayosergio frequencybasedhaplotypereconstructionfromdeepsequencingdataofbacterialpopulations
AT sanchezrodriguezaminael frequencybasedhaplotypereconstructionfromdeepsequencingdataofbacterialpopulations
AT swingstoon frequencybasedhaplotypereconstructionfromdeepsequencingdataofbacterialpopulations
AT vandenberghbram frequencybasedhaplotypereconstructionfromdeepsequencingdataofbacterialpopulations
AT dubeyakanksha frequencybasedhaplotypereconstructionfromdeepsequencingdataofbacterialpopulations
AT steenackershans frequencybasedhaplotypereconstructionfromdeepsequencingdataofbacterialpopulations
AT michielsjan frequencybasedhaplotypereconstructionfromdeepsequencingdataofbacterialpopulations
AT fostierjan frequencybasedhaplotypereconstructionfromdeepsequencingdataofbacterialpopulations
AT marchalkathleen frequencybasedhaplotypereconstructionfromdeepsequencingdataofbacterialpopulations