Cargando…

Extending Maximal Perfect Haplotype Blocks to the Realm of Pangenomics

Recent work provides the first method to measure the relative fitness of genomic variants within a population that scales to large numbers of genomes. A key component of the computation involves finding conserved haplotype blocks, which can be done in linear time. Here, we extend the notion of conse...

Descripción completa

Detalles Bibliográficos
Autores principales: Williams, Lucia, Mumey, Brendan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7197059/
http://dx.doi.org/10.1007/978-3-030-42266-0_4
_version_ 1783528808919334912
author Williams, Lucia
Mumey, Brendan
author_facet Williams, Lucia
Mumey, Brendan
author_sort Williams, Lucia
collection PubMed
description Recent work provides the first method to measure the relative fitness of genomic variants within a population that scales to large numbers of genomes. A key component of the computation involves finding conserved haplotype blocks, which can be done in linear time. Here, we extend the notion of conserved haplotype blocks to pangenomes, which can store more complex variation than a single reference genome. We define a maximal perfect pangenome haplotype block and give a linear-time, suffix tree based approach to find all such blocks from a set of pangenome haplotypes. We demonstrate the method by applying it to a pangenome built from yeast strains.
format Online
Article
Text
id pubmed-7197059
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-71970592020-05-04 Extending Maximal Perfect Haplotype Blocks to the Realm of Pangenomics Williams, Lucia Mumey, Brendan Algorithms for Computational Biology Article Recent work provides the first method to measure the relative fitness of genomic variants within a population that scales to large numbers of genomes. A key component of the computation involves finding conserved haplotype blocks, which can be done in linear time. Here, we extend the notion of conserved haplotype blocks to pangenomes, which can store more complex variation than a single reference genome. We define a maximal perfect pangenome haplotype block and give a linear-time, suffix tree based approach to find all such blocks from a set of pangenome haplotypes. We demonstrate the method by applying it to a pangenome built from yeast strains. 2020-02-01 /pmc/articles/PMC7197059/ http://dx.doi.org/10.1007/978-3-030-42266-0_4 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Williams, Lucia
Mumey, Brendan
Extending Maximal Perfect Haplotype Blocks to the Realm of Pangenomics
title Extending Maximal Perfect Haplotype Blocks to the Realm of Pangenomics
title_full Extending Maximal Perfect Haplotype Blocks to the Realm of Pangenomics
title_fullStr Extending Maximal Perfect Haplotype Blocks to the Realm of Pangenomics
title_full_unstemmed Extending Maximal Perfect Haplotype Blocks to the Realm of Pangenomics
title_short Extending Maximal Perfect Haplotype Blocks to the Realm of Pangenomics
title_sort extending maximal perfect haplotype blocks to the realm of pangenomics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7197059/
http://dx.doi.org/10.1007/978-3-030-42266-0_4
work_keys_str_mv AT williamslucia extendingmaximalperfecthaplotypeblockstotherealmofpangenomics
AT mumeybrendan extendingmaximalperfecthaplotypeblockstotherealmofpangenomics