Cargando…

Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples

Wastewater-based epidemiology has been used extensively throughout the COVID-19 (coronavirus disease 19) pandemic to detect and monitor the spread and prevalence of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) and its variants. It has proven an excellent, complementary tool to clinic...

Descripción completa

Detalles Bibliográficos
Autores principales: Bassano, Irene, Ramachandran, Vinoy K., Khalifa, Mohammad S., Lilley, Chris J., Brown, Mathew R., van Aerle, Ronny, Denise, Hubert, Rowe, William, George, Airey, Cairns, Edward, Wierzbicki, Claudia, Pickwell, Natalie D., Carlile, Matthew, Holmes, Nadine, Payne, Alexander, Loose, Matthew, Burke, Terry A., Paterson, Steve, Wade, Matthew J., Grimsley, Jasmine M. S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10210938/
https://www.ncbi.nlm.nih.gov/pubmed/37074153
http://dx.doi.org/10.1099/mgen.0.000933
_version_ 1785047176145010688
author Bassano, Irene
Ramachandran, Vinoy K.
Khalifa, Mohammad S.
Lilley, Chris J.
Brown, Mathew R.
van Aerle, Ronny
Denise, Hubert
Rowe, William
George, Airey
Cairns, Edward
Wierzbicki, Claudia
Pickwell, Natalie D.
Carlile, Matthew
Holmes, Nadine
Payne, Alexander
Loose, Matthew
Burke, Terry A.
Paterson, Steve
Wade, Matthew J.
Grimsley, Jasmine M. S.
author_facet Bassano, Irene
Ramachandran, Vinoy K.
Khalifa, Mohammad S.
Lilley, Chris J.
Brown, Mathew R.
van Aerle, Ronny
Denise, Hubert
Rowe, William
George, Airey
Cairns, Edward
Wierzbicki, Claudia
Pickwell, Natalie D.
Carlile, Matthew
Holmes, Nadine
Payne, Alexander
Loose, Matthew
Burke, Terry A.
Paterson, Steve
Wade, Matthew J.
Grimsley, Jasmine M. S.
author_sort Bassano, Irene
collection PubMed
description Wastewater-based epidemiology has been used extensively throughout the COVID-19 (coronavirus disease 19) pandemic to detect and monitor the spread and prevalence of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) and its variants. It has proven an excellent, complementary tool to clinical sequencing, supporting the insights gained and helping to make informed public-health decisions. Consequently, many groups globally have developed bioinformatics pipelines to analyse sequencing data from wastewater. Accurate calling of mutations is critical in this process and in the assignment of circulating variants; yet, to date, the performance of variant-calling algorithms in wastewater samples has not been investigated. To address this, we compared the performance of six variant callers (VarScan, iVar, GATK, FreeBayes, LoFreq and BCFtools), used widely in bioinformatics pipelines, on 19 synthetic samples with known ratios of three different SARS-CoV-2 variants of concern (VOCs) (Alpha, Beta and Delta), as well as 13 wastewater samples collected in London between the 15th and 18th December 2021. We used the fundamental parameters of recall (sensitivity) and precision (specificity) to confirm the presence of mutational profiles defining specific variants across the six variant callers. Our results show that BCFtools, FreeBayes and VarScan found the expected variants with higher precision and recall than GATK or iVar, although the latter identified more expected defining mutations than other callers. LoFreq gave the least reliable results due to the high number of false-positive mutations detected, resulting in lower precision. Similar results were obtained for both the synthetic and wastewater samples.
format Online
Article
Text
id pubmed-10210938
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-102109382023-05-26 Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples Bassano, Irene Ramachandran, Vinoy K. Khalifa, Mohammad S. Lilley, Chris J. Brown, Mathew R. van Aerle, Ronny Denise, Hubert Rowe, William George, Airey Cairns, Edward Wierzbicki, Claudia Pickwell, Natalie D. Carlile, Matthew Holmes, Nadine Payne, Alexander Loose, Matthew Burke, Terry A. Paterson, Steve Wade, Matthew J. Grimsley, Jasmine M. S. Microb Genom Research Articles Wastewater-based epidemiology has been used extensively throughout the COVID-19 (coronavirus disease 19) pandemic to detect and monitor the spread and prevalence of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) and its variants. It has proven an excellent, complementary tool to clinical sequencing, supporting the insights gained and helping to make informed public-health decisions. Consequently, many groups globally have developed bioinformatics pipelines to analyse sequencing data from wastewater. Accurate calling of mutations is critical in this process and in the assignment of circulating variants; yet, to date, the performance of variant-calling algorithms in wastewater samples has not been investigated. To address this, we compared the performance of six variant callers (VarScan, iVar, GATK, FreeBayes, LoFreq and BCFtools), used widely in bioinformatics pipelines, on 19 synthetic samples with known ratios of three different SARS-CoV-2 variants of concern (VOCs) (Alpha, Beta and Delta), as well as 13 wastewater samples collected in London between the 15th and 18th December 2021. We used the fundamental parameters of recall (sensitivity) and precision (specificity) to confirm the presence of mutational profiles defining specific variants across the six variant callers. Our results show that BCFtools, FreeBayes and VarScan found the expected variants with higher precision and recall than GATK or iVar, although the latter identified more expected defining mutations than other callers. LoFreq gave the least reliable results due to the high number of false-positive mutations detected, resulting in lower precision. Similar results were obtained for both the synthetic and wastewater samples. Microbiology Society 2023-04-19 /pmc/articles/PMC10210938/ /pubmed/37074153 http://dx.doi.org/10.1099/mgen.0.000933 Text en © 2023 The Authors https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.
spellingShingle Research Articles
Bassano, Irene
Ramachandran, Vinoy K.
Khalifa, Mohammad S.
Lilley, Chris J.
Brown, Mathew R.
van Aerle, Ronny
Denise, Hubert
Rowe, William
George, Airey
Cairns, Edward
Wierzbicki, Claudia
Pickwell, Natalie D.
Carlile, Matthew
Holmes, Nadine
Payne, Alexander
Loose, Matthew
Burke, Terry A.
Paterson, Steve
Wade, Matthew J.
Grimsley, Jasmine M. S.
Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples
title Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples
title_full Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples
title_fullStr Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples
title_full_unstemmed Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples
title_short Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples
title_sort evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of sars-cov-2 variants in synthetic and wastewater samples
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10210938/
https://www.ncbi.nlm.nih.gov/pubmed/37074153
http://dx.doi.org/10.1099/mgen.0.000933
work_keys_str_mv AT bassanoirene evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT ramachandranvinoyk evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT khalifamohammads evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT lilleychrisj evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT brownmathewr evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT vanaerleronny evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT denisehubert evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT rowewilliam evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT georgeairey evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT cairnsedward evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT wierzbickiclaudia evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT pickwellnatalied evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT carlilematthew evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT holmesnadine evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT paynealexander evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT loosematthew evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT burketerrya evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT patersonsteve evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT wadematthewj evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples
AT grimsleyjasminems evaluationofvariantcallingalgorithmsforwastewaterbasedepidemiologyusingmixedpopulationsofsarscov2variantsinsyntheticandwastewatersamples