Cargando…
flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning
Resolving haplotypes in polyploid genomes using phase information from sequencing reads is an important and challenging problem. We introduce two new mathematical formulations of polyploid haplotype phasing: (1) the min-sum max tree partition problem, which is a more flexible graphical metric compar...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Mary Ann Liebert, Inc., publishers
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8892958/ https://www.ncbi.nlm.nih.gov/pubmed/35041529 http://dx.doi.org/10.1089/cmb.2021.0436 |
_version_ | 1784662284863275008 |
---|---|
author | Shaw, Jim Yu, Yun William |
author_facet | Shaw, Jim Yu, Yun William |
author_sort | Shaw, Jim |
collection | PubMed |
description | Resolving haplotypes in polyploid genomes using phase information from sequencing reads is an important and challenging problem. We introduce two new mathematical formulations of polyploid haplotype phasing: (1) the min-sum max tree partition problem, which is a more flexible graphical metric compared with the standard minimum error correction (MEC) model in the polyploid setting, and (2) the uniform probabilistic error minimization model, which is a probabilistic analogue of the MEC model. We incorporate both formulations into a long-read based polyploid haplotype phasing method called flopp. We show that flopp compares favorably with state-of-the-art algorithms—up to 30 times faster with 2 times fewer switch errors on 6 × ploidy simulated data. Further, we show using real nanopore data that flopp can quickly reveal reasonable haplotype structures from the autotetraploid Solanum tuberosum (potato). |
format | Online Article Text |
id | pubmed-8892958 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Mary Ann Liebert, Inc., publishers |
record_format | MEDLINE/PubMed |
spelling | pubmed-88929582022-03-03 flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning Shaw, Jim Yu, Yun William J Comput Biol Research Articles Resolving haplotypes in polyploid genomes using phase information from sequencing reads is an important and challenging problem. We introduce two new mathematical formulations of polyploid haplotype phasing: (1) the min-sum max tree partition problem, which is a more flexible graphical metric compared with the standard minimum error correction (MEC) model in the polyploid setting, and (2) the uniform probabilistic error minimization model, which is a probabilistic analogue of the MEC model. We incorporate both formulations into a long-read based polyploid haplotype phasing method called flopp. We show that flopp compares favorably with state-of-the-art algorithms—up to 30 times faster with 2 times fewer switch errors on 6 × ploidy simulated data. Further, we show using real nanopore data that flopp can quickly reveal reasonable haplotype structures from the autotetraploid Solanum tuberosum (potato). Mary Ann Liebert, Inc., publishers 2022-02-01 2022-02-16 /pmc/articles/PMC8892958/ /pubmed/35041529 http://dx.doi.org/10.1089/cmb.2021.0436 Text en © Jim Shaw and Yun William Yu, 2022. Published by Mary Ann Liebert. Inc. https://creativecommons.org/licenses/by/4.0/This Open Access article is distributed under the terms of the Creative Commons License (http://creativecommons.org/licenses/by/4.0 (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research Articles Shaw, Jim Yu, Yun William flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning |
title | flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning |
title_full | flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning |
title_fullStr | flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning |
title_full_unstemmed | flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning |
title_short | flopp: Extremely Fast Long-Read Polyploid Haplotype Phasing by Uniform Tree Partitioning |
title_sort | flopp: extremely fast long-read polyploid haplotype phasing by uniform tree partitioning |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8892958/ https://www.ncbi.nlm.nih.gov/pubmed/35041529 http://dx.doi.org/10.1089/cmb.2021.0436 |
work_keys_str_mv | AT shawjim floppextremelyfastlongreadpolyploidhaplotypephasingbyuniformtreepartitioning AT yuyunwilliam floppextremelyfastlongreadpolyploidhaplotypephasingbyuniformtreepartitioning |