Cargando…

Long-read sequencing for reliably calling the mompS allele in Legionella pneumophila sequence-based typing

Sequence-based typing (SBT) of Legionella pneumophila is a valuable tool in epidemiological studies and outbreak investigations of Legionnaires’ disease. In the L. pneumophila SBT scheme, mompS2 is one of seven genes that determine the sequence type (ST). The Legionella genome typically contains two...

Descripción completa

Detalles Bibliográficos
Autores principales: Krøvel, Anne Vatland, Hetland, Marit A. K., Bernhoff, Eva, Bjørheim, Anna Steensen, Soma, Markus André, Löhr, Iren H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10226664/
https://www.ncbi.nlm.nih.gov/pubmed/37256104
http://dx.doi.org/10.3389/fcimb.2023.1176182
_version_ 1785050619964293120
author Krøvel, Anne Vatland
Hetland, Marit A. K.
Bernhoff, Eva
Bjørheim, Anna Steensen
Soma, Markus André
Löhr, Iren H.
author_facet Krøvel, Anne Vatland
Hetland, Marit A. K.
Bernhoff, Eva
Bjørheim, Anna Steensen
Soma, Markus André
Löhr, Iren H.
author_sort Krøvel, Anne Vatland
collection PubMed
description Sequence-based typing (SBT) of Legionella pneumophila is a valuable tool in epidemiological studies and outbreak investigations of Legionnaires’ disease. In the L. pneumophila SBT scheme, mompS2 is one of seven genes that determine the sequence type (ST). The Legionella genome typically contains two copies of mompS (mompS1 and mompS2). When they are non-identical it can be challenging to determine the mompS2 allele, and subsequently the ST, from Illumina short-reads. In our collection of 233 L. pneumophila genomes, there were 62 STs, 18 of which carried non-identical mompS copies. Using short-reads, the mompS2 allele was misassembled or untypeable in several STs. Genomes belonging to ST154 and ST574, which carried mompS1 allele 7 and mompS2 allele 15, were assigned an incorrect mompS2 allele and/or mompS gene copy number when short-read assembled. For other isolates, mainly those carrying non-identical mompS copies, short-read assemblers occasionally failed to resolve the structure of the mompS-region, also resulting in untypeability from the short-read data. In this study, we wanted to understand the challenges we observed with calling the mompS2 allele from short-reads, assess if other short-read methods were able to resolve the mompS-region, and investigate the possibility of using long-reads to obtain the mompS alleles, and thereby perform L. pneumophila SBT from long-reads only. We found that the choice of short-read assembler had a major impact on resolving the mompS-region and thus SBT from short-reads, but no method consistently solved the mompS2 allele. By using Oxford Nanopore Technology (ONT) sequencing together with Trycycler and Medaka for long-read assembly and polishing we were able to resolve the mompS copies and correctly identify the mompS2 allele, in accordance with Sanger sequencing/EQA results for all tested isolates (n=35). The remaining six genes of the SBT profile could also be determined from the ONT-only reads. The STs called from ONT-only assemblies were also consistent with hybrid-assemblies of Illumina and ONT reads. We therefore propose ONT sequencing as an alternative method to perform L. pneumophila SBT to overcome the mompS challenge observed with short-reads. To facilitate this, we have developed ONTmompS (https://github.com/marithetland/ONTmompS), an in silico approach to determine L. pneumophila ST from long-read or hybrid assemblies.
format Online
Article
Text
id pubmed-10226664
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102266642023-05-30 Long-read sequencing for reliably calling the mompS allele in Legionella pneumophila sequence-based typing Krøvel, Anne Vatland Hetland, Marit A. K. Bernhoff, Eva Bjørheim, Anna Steensen Soma, Markus André Löhr, Iren H. Front Cell Infect Microbiol Cellular and Infection Microbiology Sequence-based typing (SBT) of Legionella pneumophila is a valuable tool in epidemiological studies and outbreak investigations of Legionnaires’ disease. In the L. pneumophila SBT scheme, mompS2 is one of seven genes that determine the sequence type (ST). The Legionella genome typically contains two copies of mompS (mompS1 and mompS2). When they are non-identical it can be challenging to determine the mompS2 allele, and subsequently the ST, from Illumina short-reads. In our collection of 233 L. pneumophila genomes, there were 62 STs, 18 of which carried non-identical mompS copies. Using short-reads, the mompS2 allele was misassembled or untypeable in several STs. Genomes belonging to ST154 and ST574, which carried mompS1 allele 7 and mompS2 allele 15, were assigned an incorrect mompS2 allele and/or mompS gene copy number when short-read assembled. For other isolates, mainly those carrying non-identical mompS copies, short-read assemblers occasionally failed to resolve the structure of the mompS-region, also resulting in untypeability from the short-read data. In this study, we wanted to understand the challenges we observed with calling the mompS2 allele from short-reads, assess if other short-read methods were able to resolve the mompS-region, and investigate the possibility of using long-reads to obtain the mompS alleles, and thereby perform L. pneumophila SBT from long-reads only. We found that the choice of short-read assembler had a major impact on resolving the mompS-region and thus SBT from short-reads, but no method consistently solved the mompS2 allele. By using Oxford Nanopore Technology (ONT) sequencing together with Trycycler and Medaka for long-read assembly and polishing we were able to resolve the mompS copies and correctly identify the mompS2 allele, in accordance with Sanger sequencing/EQA results for all tested isolates (n=35). The remaining six genes of the SBT profile could also be determined from the ONT-only reads. The STs called from ONT-only assemblies were also consistent with hybrid-assemblies of Illumina and ONT reads. We therefore propose ONT sequencing as an alternative method to perform L. pneumophila SBT to overcome the mompS challenge observed with short-reads. To facilitate this, we have developed ONTmompS (https://github.com/marithetland/ONTmompS), an in silico approach to determine L. pneumophila ST from long-read or hybrid assemblies. Frontiers Media S.A. 2023-05-15 /pmc/articles/PMC10226664/ /pubmed/37256104 http://dx.doi.org/10.3389/fcimb.2023.1176182 Text en Copyright © 2023 Krøvel, Hetland, Bernhoff, Bjørheim, Soma and Löhr https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cellular and Infection Microbiology
Krøvel, Anne Vatland
Hetland, Marit A. K.
Bernhoff, Eva
Bjørheim, Anna Steensen
Soma, Markus André
Löhr, Iren H.
Long-read sequencing for reliably calling the mompS allele in Legionella pneumophila sequence-based typing
title Long-read sequencing for reliably calling the mompS allele in Legionella pneumophila sequence-based typing
title_full Long-read sequencing for reliably calling the mompS allele in Legionella pneumophila sequence-based typing
title_fullStr Long-read sequencing for reliably calling the mompS allele in Legionella pneumophila sequence-based typing
title_full_unstemmed Long-read sequencing for reliably calling the mompS allele in Legionella pneumophila sequence-based typing
title_short Long-read sequencing for reliably calling the mompS allele in Legionella pneumophila sequence-based typing
title_sort long-read sequencing for reliably calling the momps allele in legionella pneumophila sequence-based typing
topic Cellular and Infection Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10226664/
https://www.ncbi.nlm.nih.gov/pubmed/37256104
http://dx.doi.org/10.3389/fcimb.2023.1176182
work_keys_str_mv AT krøvelannevatland longreadsequencingforreliablycallingthemompsalleleinlegionellapneumophilasequencebasedtyping
AT hetlandmaritak longreadsequencingforreliablycallingthemompsalleleinlegionellapneumophilasequencebasedtyping
AT bernhoffeva longreadsequencingforreliablycallingthemompsalleleinlegionellapneumophilasequencebasedtyping
AT bjørheimannasteensen longreadsequencingforreliablycallingthemompsalleleinlegionellapneumophilasequencebasedtyping
AT somamarkusandre longreadsequencingforreliablycallingthemompsalleleinlegionellapneumophilasequencebasedtyping
AT lohrirenh longreadsequencingforreliablycallingthemompsalleleinlegionellapneumophilasequencebasedtyping