Cargando…

Pandora: nucleotide-resolution bacterial pan-genomics with reference graphs

We present pandora, a novel pan-genome graph structure and algorithms for identifying variants across the full bacterial pan-genome. As much bacterial adaptability hinges on the accessory genome, methods which analyze SNPs in just the core genome have unsatisfactory limitations. Pandora approximates...

Descripción completa

Detalles Bibliográficos
Autores principales: Colquhoun, Rachel M., Hall, Michael B., Lima, Leandro, Roberts, Leah W., Malone, Kerri M., Hunt, Martin, Letcher, Brice, Hawkey, Jane, George, Sophie, Pankhurst, Louise, Iqbal, Zamin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8442373/
https://www.ncbi.nlm.nih.gov/pubmed/34521456
http://dx.doi.org/10.1186/s13059-021-02473-1
Descripción
Sumario:We present pandora, a novel pan-genome graph structure and algorithms for identifying variants across the full bacterial pan-genome. As much bacterial adaptability hinges on the accessory genome, methods which analyze SNPs in just the core genome have unsatisfactory limitations. Pandora approximates a sequenced genome as a recombinant of references, detects novel variation and pan-genotypes multiple samples. Using a reference graph of 578 Escherichia coli genomes, we compare 20 diverse isolates. Pandora recovers more rare SNPs than single-reference-based tools, is significantly better than picking the closest RefSeq reference, and provides a stable framework for analyzing diverse samples without reference bias. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02473-1.