Cargando…

High-resolution sweep metagenomics using fast probabilistic inference

Determining the composition of bacterial communities beyond the level of a genus or species is challenging because of the considerable overlap between genomes representing close relatives. Here, we present the mSWEEP pipeline for identifying and estimating the relative sequence abundances of bacteri...

Descripción completa

Detalles Bibliográficos
Autores principales: Mäklin, Tommi, Kallonen, Teemu, David, Sophia, Boinett, Christine J., Pascoe, Ben, Méric, Guillaume, Aanensen, David M., Feil, Edward J., Baker, Stephen, Parkhill, Julian, Sheppard, Samuel K., Corander, Jukka, Honkela, Antti
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8543175/
https://www.ncbi.nlm.nih.gov/pubmed/34746439
http://dx.doi.org/10.12688/wellcomeopenres.15639.2
Descripción
Sumario:Determining the composition of bacterial communities beyond the level of a genus or species is challenging because of the considerable overlap between genomes representing close relatives. Here, we present the mSWEEP pipeline for identifying and estimating the relative sequence abundances of bacterial lineages from plate sweeps of enrichment cultures. mSWEEP leverages biologically grouped sequence assembly databases, applying probabilistic modelling, and provides controls for false positive results. Using sequencing data from major pathogens, we demonstrate significant improvements in lineage quantification and detection accuracy. Our pipeline facilitates investigating cultures comprising mixtures of bacteria, and opens up a new field of plate sweep metagenomics.