Cargando…

A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics

Metagenomics is a powerful tool to identify novel or unexpected pathogens, since it is generic and relatively unbiased. The limit of detection (LOD) is a critical parameter for the routine application of methods in the clinical diagnostic context. Although attempts for the determination of LODs for...

Descripción completa

Detalles Bibliográficos
Autores principales: Ebinger, Arnt, Fischer, Susanne, Höper, Dirk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7822954/
https://www.ncbi.nlm.nih.gov/pubmed/33552445
http://dx.doi.org/10.1016/j.csbj.2020.12.040
_version_ 1783639744863797248
author Ebinger, Arnt
Fischer, Susanne
Höper, Dirk
author_facet Ebinger, Arnt
Fischer, Susanne
Höper, Dirk
author_sort Ebinger, Arnt
collection PubMed
description Metagenomics is a powerful tool to identify novel or unexpected pathogens, since it is generic and relatively unbiased. The limit of detection (LOD) is a critical parameter for the routine application of methods in the clinical diagnostic context. Although attempts for the determination of LODs for metagenomics next-generation sequencing (mNGS) have been made previously, these were only applicable for specific target species in defined samples matrices. Therefore, we developed and validated a generalized probability-based model to assess the sample-specific LOD of mNGS experiments (LOD(mNGS)). Initial rarefaction analyses with datasets of Borna disease virus 1 human encephalitis cases revealed a stochastic behavior of virus read detection. Based on this, we transformed the Bernoulli formula to predict the minimal necessary dataset size to detect one virus read with a probability of 99%. We validated the formula with 30 datasets from diseased individuals, resulting in an accuracy of 99.1% and an average of 4.5 ± 0.4 viral reads found in the calculated minimal dataset size. We demonstrated by modeling the virus genome size, virus-, and total RNA-concentration that the main determinant of mNGS sensitivity is the virus-sample background ratio. The predicted LOD(mNGS) for the respective pathogenic virus in the datasets were congruent with the virus-concentration determined by RT-qPCR. Theoretical assumptions were further confirmed by correlation analysis of mNGS and RT-qPCR data from the samples of the analyzed datasets. This approach should guide standardization of mNGS application, due to the generalized concept of LOD(mNGS).
format Online
Article
Text
id pubmed-7822954
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-78229542021-02-04 A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics Ebinger, Arnt Fischer, Susanne Höper, Dirk Comput Struct Biotechnol J Research Article Metagenomics is a powerful tool to identify novel or unexpected pathogens, since it is generic and relatively unbiased. The limit of detection (LOD) is a critical parameter for the routine application of methods in the clinical diagnostic context. Although attempts for the determination of LODs for metagenomics next-generation sequencing (mNGS) have been made previously, these were only applicable for specific target species in defined samples matrices. Therefore, we developed and validated a generalized probability-based model to assess the sample-specific LOD of mNGS experiments (LOD(mNGS)). Initial rarefaction analyses with datasets of Borna disease virus 1 human encephalitis cases revealed a stochastic behavior of virus read detection. Based on this, we transformed the Bernoulli formula to predict the minimal necessary dataset size to detect one virus read with a probability of 99%. We validated the formula with 30 datasets from diseased individuals, resulting in an accuracy of 99.1% and an average of 4.5 ± 0.4 viral reads found in the calculated minimal dataset size. We demonstrated by modeling the virus genome size, virus-, and total RNA-concentration that the main determinant of mNGS sensitivity is the virus-sample background ratio. The predicted LOD(mNGS) for the respective pathogenic virus in the datasets were congruent with the virus-concentration determined by RT-qPCR. Theoretical assumptions were further confirmed by correlation analysis of mNGS and RT-qPCR data from the samples of the analyzed datasets. This approach should guide standardization of mNGS application, due to the generalized concept of LOD(mNGS). Research Network of Computational and Structural Biotechnology 2020-12-26 /pmc/articles/PMC7822954/ /pubmed/33552445 http://dx.doi.org/10.1016/j.csbj.2020.12.040 Text en © 2021 Friedrich-Loeffler-Institut http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Ebinger, Arnt
Fischer, Susanne
Höper, Dirk
A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics
title A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics
title_full A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics
title_fullStr A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics
title_full_unstemmed A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics
title_short A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics
title_sort theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7822954/
https://www.ncbi.nlm.nih.gov/pubmed/33552445
http://dx.doi.org/10.1016/j.csbj.2020.12.040
work_keys_str_mv AT ebingerarnt atheoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics
AT fischersusanne atheoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics
AT hoperdirk atheoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics
AT ebingerarnt theoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics
AT fischersusanne theoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics
AT hoperdirk theoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics