Cargando…

KAGE: fast alignment-free graph-based genotyping of SNPs and short indels

Genotyping is a core application of high-throughput sequencing. We present KAGE, a genotyper for SNPs and short indels that is inspired by recent developments within graph-based genome representations and alignment-free methods. KAGE uses a pan-genome representation of the population to efficiently...

Descripción completa

Detalles Bibliográficos
Autores principales: Grytten, Ivar, Dagestad Rand, Knut, Sandve, Geir Kjetil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9531401/
https://www.ncbi.nlm.nih.gov/pubmed/36195962
http://dx.doi.org/10.1186/s13059-022-02771-2
Descripción
Sumario:Genotyping is a core application of high-throughput sequencing. We present KAGE, a genotyper for SNPs and short indels that is inspired by recent developments within graph-based genome representations and alignment-free methods. KAGE uses a pan-genome representation of the population to efficiently and accurately predict genotypes. Two novel ideas improve both the speed and accuracy: a Bayesian model incorporates genotypes from thousands of individuals to improve prediction accuracy, and a computationally efficient method leverages correlation between variants. We show that the accuracy of KAGE is at par with the best existing alignment-free genotypers, while being an order of magnitude faster. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02771-2.