Cargando…

Probabilistic single-individual haplotyping

Motivation: Accurate haplotyping—determining from which parent particular portions of the genome are inherited—is still mostly an unresolved problem in genomics. This problem has only recently started to become tractable, thanks to the development of new long read sequencing technologies. Here, we i...

Descripción completa

Detalles Bibliográficos
Autor principal: Kuleshov, Volodymyr
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147930/
https://www.ncbi.nlm.nih.gov/pubmed/25161223
http://dx.doi.org/10.1093/bioinformatics/btu484
_version_ 1782332540571353088
author Kuleshov, Volodymyr
author_facet Kuleshov, Volodymyr
author_sort Kuleshov, Volodymyr
collection PubMed
description Motivation: Accurate haplotyping—determining from which parent particular portions of the genome are inherited—is still mostly an unresolved problem in genomics. This problem has only recently started to become tractable, thanks to the development of new long read sequencing technologies. Here, we introduce ProbHap, a haplotyping algorithm targeted at such technologies. The main algorithmic idea of ProbHap is a new dynamic programming algorithm that exactly optimizes a likelihood function specified by a probabilistic graphical model and which generalizes a popular objective called the minimum error correction. In addition to being accurate, ProbHap also provides confidence scores at phased positions. Results: On a standard benchmark dataset, ProbHap makes 11% fewer errors than current state-of-the-art methods. This accuracy can be further increased by excluding low-confidence positions, at the cost of a small drop in haplotype completeness. Availability: Our source code is freely available at: https://github.com/kuleshov/ProbHap. Contact: kuleshov@stanford.edu
format Online
Article
Text
id pubmed-4147930
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41479302014-09-02 Probabilistic single-individual haplotyping Kuleshov, Volodymyr Bioinformatics Eccb 2014 Proceedings Papers Committee Motivation: Accurate haplotyping—determining from which parent particular portions of the genome are inherited—is still mostly an unresolved problem in genomics. This problem has only recently started to become tractable, thanks to the development of new long read sequencing technologies. Here, we introduce ProbHap, a haplotyping algorithm targeted at such technologies. The main algorithmic idea of ProbHap is a new dynamic programming algorithm that exactly optimizes a likelihood function specified by a probabilistic graphical model and which generalizes a popular objective called the minimum error correction. In addition to being accurate, ProbHap also provides confidence scores at phased positions. Results: On a standard benchmark dataset, ProbHap makes 11% fewer errors than current state-of-the-art methods. This accuracy can be further increased by excluding low-confidence positions, at the cost of a small drop in haplotype completeness. Availability: Our source code is freely available at: https://github.com/kuleshov/ProbHap. Contact: kuleshov@stanford.edu Oxford University Press 2014-09-01 2014-08-22 /pmc/articles/PMC4147930/ /pubmed/25161223 http://dx.doi.org/10.1093/bioinformatics/btu484 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Eccb 2014 Proceedings Papers Committee
Kuleshov, Volodymyr
Probabilistic single-individual haplotyping
title Probabilistic single-individual haplotyping
title_full Probabilistic single-individual haplotyping
title_fullStr Probabilistic single-individual haplotyping
title_full_unstemmed Probabilistic single-individual haplotyping
title_short Probabilistic single-individual haplotyping
title_sort probabilistic single-individual haplotyping
topic Eccb 2014 Proceedings Papers Committee
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147930/
https://www.ncbi.nlm.nih.gov/pubmed/25161223
http://dx.doi.org/10.1093/bioinformatics/btu484
work_keys_str_mv AT kuleshovvolodymyr probabilisticsingleindividualhaplotyping