Cargando…

HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data

As the more recent next-generation sequencing (NGS) technologies provide longer read sequences, the use of sequencing datasets for complete haplotype phasing is fast becoming a reality, allowing haplotype reconstruction of a single sequenced genome. Nearly all previous haplotype reconstruction studi...

Descripción completa

Detalles Bibliográficos
Autores principales: Berger, Emily, Yorukoglu, Deniz, Peng, Jian, Berger, Bonnie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3967924/
https://www.ncbi.nlm.nih.gov/pubmed/24675685
http://dx.doi.org/10.1371/journal.pcbi.1003502
_version_ 1782309077399896064
author Berger, Emily
Yorukoglu, Deniz
Peng, Jian
Berger, Bonnie
author_facet Berger, Emily
Yorukoglu, Deniz
Peng, Jian
Berger, Bonnie
author_sort Berger, Emily
collection PubMed
description As the more recent next-generation sequencing (NGS) technologies provide longer read sequences, the use of sequencing datasets for complete haplotype phasing is fast becoming a reality, allowing haplotype reconstruction of a single sequenced genome. Nearly all previous haplotype reconstruction studies have focused on diploid genomes and are rarely scalable to genomes with higher ploidy. Yet computational investigations into polyploid genomes carry great importance, impacting plant, yeast and fish genomics, as well as the studies of the evolution of modern-day eukaryotes and (epi)genetic interactions between copies of genes. In this paper, we describe a novel maximum-likelihood estimation framework, HapTree, for polyploid haplotype assembly of an individual genome using NGS read datasets. We evaluate the performance of HapTree on simulated polyploid sequencing read data modeled after Illumina sequencing technologies. For triploid and higher ploidy genomes, we demonstrate that HapTree substantially improves haplotype assembly accuracy and efficiency over the state-of-the-art; moreover, HapTree is the first scalable polyplotyping method for higher ploidy. As a proof of concept, we also test our method on real sequencing data from NA12878 (1000 Genomes Project) and evaluate the quality of assembled haplotypes with respect to trio-based diplotype annotation as the ground truth. The results indicate that HapTree significantly improves the switch accuracy within phased haplotype blocks as compared to existing haplotype assembly methods, while producing comparable minimum error correction (MEC) values. A summary of this paper appears in the proceedings of the RECOMB 2014 conference, April 2–5.
format Online
Article
Text
id pubmed-3967924
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39679242014-04-01 HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data Berger, Emily Yorukoglu, Deniz Peng, Jian Berger, Bonnie PLoS Comput Biol Research Article As the more recent next-generation sequencing (NGS) technologies provide longer read sequences, the use of sequencing datasets for complete haplotype phasing is fast becoming a reality, allowing haplotype reconstruction of a single sequenced genome. Nearly all previous haplotype reconstruction studies have focused on diploid genomes and are rarely scalable to genomes with higher ploidy. Yet computational investigations into polyploid genomes carry great importance, impacting plant, yeast and fish genomics, as well as the studies of the evolution of modern-day eukaryotes and (epi)genetic interactions between copies of genes. In this paper, we describe a novel maximum-likelihood estimation framework, HapTree, for polyploid haplotype assembly of an individual genome using NGS read datasets. We evaluate the performance of HapTree on simulated polyploid sequencing read data modeled after Illumina sequencing technologies. For triploid and higher ploidy genomes, we demonstrate that HapTree substantially improves haplotype assembly accuracy and efficiency over the state-of-the-art; moreover, HapTree is the first scalable polyplotyping method for higher ploidy. As a proof of concept, we also test our method on real sequencing data from NA12878 (1000 Genomes Project) and evaluate the quality of assembled haplotypes with respect to trio-based diplotype annotation as the ground truth. The results indicate that HapTree significantly improves the switch accuracy within phased haplotype blocks as compared to existing haplotype assembly methods, while producing comparable minimum error correction (MEC) values. A summary of this paper appears in the proceedings of the RECOMB 2014 conference, April 2–5. Public Library of Science 2014-03-27 /pmc/articles/PMC3967924/ /pubmed/24675685 http://dx.doi.org/10.1371/journal.pcbi.1003502 Text en © 2014 Berger et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Berger, Emily
Yorukoglu, Deniz
Peng, Jian
Berger, Bonnie
HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data
title HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data
title_full HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data
title_fullStr HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data
title_full_unstemmed HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data
title_short HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data
title_sort haptree: a novel bayesian framework for single individual polyplotyping using ngs data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3967924/
https://www.ncbi.nlm.nih.gov/pubmed/24675685
http://dx.doi.org/10.1371/journal.pcbi.1003502
work_keys_str_mv AT bergeremily haptreeanovelbayesianframeworkforsingleindividualpolyplotypingusingngsdata
AT yorukogludeniz haptreeanovelbayesianframeworkforsingleindividualpolyplotypingusingngsdata
AT pengjian haptreeanovelbayesianframeworkforsingleindividualpolyplotypingusingngsdata
AT bergerbonnie haptreeanovelbayesianframeworkforsingleindividualpolyplotypingusingngsdata