Cargando…

Identification of isolated or mixed strains from long reads: a challenge met on Streptococcus thermophilus using a MinION sequencer

This study aimed to provide efficient recognition of bacterial strains on personal computers from MinION (Nanopore) long read data. Thanks to the fall in sequencing costs, the identification of bacteria can now proceed by whole genome sequencing. MinION is a fast, but highly error-prone sequencing d...

Descripción completa

Detalles Bibliográficos
Autores principales: Siekaniec, Grégoire, Roux, Emeline, Lemane, Téo, Guédon, Eric, Nicolas, Jacques
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8743539/
https://www.ncbi.nlm.nih.gov/pubmed/34812718
http://dx.doi.org/10.1099/mgen.0.000654
_version_ 1784629924515020800
author Siekaniec, Grégoire
Roux, Emeline
Lemane, Téo
Guédon, Eric
Nicolas, Jacques
author_facet Siekaniec, Grégoire
Roux, Emeline
Lemane, Téo
Guédon, Eric
Nicolas, Jacques
author_sort Siekaniec, Grégoire
collection PubMed
description This study aimed to provide efficient recognition of bacterial strains on personal computers from MinION (Nanopore) long read data. Thanks to the fall in sequencing costs, the identification of bacteria can now proceed by whole genome sequencing. MinION is a fast, but highly error-prone sequencing device and it is a challenge to successfully identify the strain content of unknown simple or complex microbial samples. It is heavily constrained by memory management and fast access to the read and genome fragments. Our strategy involves three steps: indexing of known genomic sequences for a given or several bacterial species; a request process to assign a read to a strain by matching it to the closest reference genomes; and a final step looking for a minimum set of strains that best explains the observed reads. We have applied our method, called ORI, on 77 strains of Streptococcus thermophilus . We worked on several genomic distances and obtained a detailed classification of the strains, together with a criterion that allows merging of what we termed ‘sibling’ strains, only separated by a few mutations. Overall, isolated strains can be safely recognized from MinION data. For mixtures of several non-sibling strains, results depend on strain abundance.
format Online
Article
Text
id pubmed-8743539
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-87435392022-01-10 Identification of isolated or mixed strains from long reads: a challenge met on Streptococcus thermophilus using a MinION sequencer Siekaniec, Grégoire Roux, Emeline Lemane, Téo Guédon, Eric Nicolas, Jacques Microb Genom Research Articles This study aimed to provide efficient recognition of bacterial strains on personal computers from MinION (Nanopore) long read data. Thanks to the fall in sequencing costs, the identification of bacteria can now proceed by whole genome sequencing. MinION is a fast, but highly error-prone sequencing device and it is a challenge to successfully identify the strain content of unknown simple or complex microbial samples. It is heavily constrained by memory management and fast access to the read and genome fragments. Our strategy involves three steps: indexing of known genomic sequences for a given or several bacterial species; a request process to assign a read to a strain by matching it to the closest reference genomes; and a final step looking for a minimum set of strains that best explains the observed reads. We have applied our method, called ORI, on 77 strains of Streptococcus thermophilus . We worked on several genomic distances and obtained a detailed classification of the strains, together with a criterion that allows merging of what we termed ‘sibling’ strains, only separated by a few mutations. Overall, isolated strains can be safely recognized from MinION data. For mixtures of several non-sibling strains, results depend on strain abundance. Microbiology Society 2021-11-23 /pmc/articles/PMC8743539/ /pubmed/34812718 http://dx.doi.org/10.1099/mgen.0.000654 Text en © 2021 The Authors https://creativecommons.org/licenses/by-nc/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution NonCommercial License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.
spellingShingle Research Articles
Siekaniec, Grégoire
Roux, Emeline
Lemane, Téo
Guédon, Eric
Nicolas, Jacques
Identification of isolated or mixed strains from long reads: a challenge met on Streptococcus thermophilus using a MinION sequencer
title Identification of isolated or mixed strains from long reads: a challenge met on Streptococcus thermophilus using a MinION sequencer
title_full Identification of isolated or mixed strains from long reads: a challenge met on Streptococcus thermophilus using a MinION sequencer
title_fullStr Identification of isolated or mixed strains from long reads: a challenge met on Streptococcus thermophilus using a MinION sequencer
title_full_unstemmed Identification of isolated or mixed strains from long reads: a challenge met on Streptococcus thermophilus using a MinION sequencer
title_short Identification of isolated or mixed strains from long reads: a challenge met on Streptococcus thermophilus using a MinION sequencer
title_sort identification of isolated or mixed strains from long reads: a challenge met on streptococcus thermophilus using a minion sequencer
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8743539/
https://www.ncbi.nlm.nih.gov/pubmed/34812718
http://dx.doi.org/10.1099/mgen.0.000654
work_keys_str_mv AT siekaniecgregoire identificationofisolatedormixedstrainsfromlongreadsachallengemetonstreptococcusthermophilususingaminionsequencer
AT rouxemeline identificationofisolatedormixedstrainsfromlongreadsachallengemetonstreptococcusthermophilususingaminionsequencer
AT lemaneteo identificationofisolatedormixedstrainsfromlongreadsachallengemetonstreptococcusthermophilususingaminionsequencer
AT guedoneric identificationofisolatedormixedstrainsfromlongreadsachallengemetonstreptococcusthermophilususingaminionsequencer
AT nicolasjacques identificationofisolatedormixedstrainsfromlongreadsachallengemetonstreptococcusthermophilususingaminionsequencer