Cargando…
A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics
Metagenomics is a powerful tool to identify novel or unexpected pathogens, since it is generic and relatively unbiased. The limit of detection (LOD) is a critical parameter for the routine application of methods in the clinical diagnostic context. Although attempts for the determination of LODs for...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7822954/ https://www.ncbi.nlm.nih.gov/pubmed/33552445 http://dx.doi.org/10.1016/j.csbj.2020.12.040 |
_version_ | 1783639744863797248 |
---|---|
author | Ebinger, Arnt Fischer, Susanne Höper, Dirk |
author_facet | Ebinger, Arnt Fischer, Susanne Höper, Dirk |
author_sort | Ebinger, Arnt |
collection | PubMed |
description | Metagenomics is a powerful tool to identify novel or unexpected pathogens, since it is generic and relatively unbiased. The limit of detection (LOD) is a critical parameter for the routine application of methods in the clinical diagnostic context. Although attempts for the determination of LODs for metagenomics next-generation sequencing (mNGS) have been made previously, these were only applicable for specific target species in defined samples matrices. Therefore, we developed and validated a generalized probability-based model to assess the sample-specific LOD of mNGS experiments (LOD(mNGS)). Initial rarefaction analyses with datasets of Borna disease virus 1 human encephalitis cases revealed a stochastic behavior of virus read detection. Based on this, we transformed the Bernoulli formula to predict the minimal necessary dataset size to detect one virus read with a probability of 99%. We validated the formula with 30 datasets from diseased individuals, resulting in an accuracy of 99.1% and an average of 4.5 ± 0.4 viral reads found in the calculated minimal dataset size. We demonstrated by modeling the virus genome size, virus-, and total RNA-concentration that the main determinant of mNGS sensitivity is the virus-sample background ratio. The predicted LOD(mNGS) for the respective pathogenic virus in the datasets were congruent with the virus-concentration determined by RT-qPCR. Theoretical assumptions were further confirmed by correlation analysis of mNGS and RT-qPCR data from the samples of the analyzed datasets. This approach should guide standardization of mNGS application, due to the generalized concept of LOD(mNGS). |
format | Online Article Text |
id | pubmed-7822954 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-78229542021-02-04 A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics Ebinger, Arnt Fischer, Susanne Höper, Dirk Comput Struct Biotechnol J Research Article Metagenomics is a powerful tool to identify novel or unexpected pathogens, since it is generic and relatively unbiased. The limit of detection (LOD) is a critical parameter for the routine application of methods in the clinical diagnostic context. Although attempts for the determination of LODs for metagenomics next-generation sequencing (mNGS) have been made previously, these were only applicable for specific target species in defined samples matrices. Therefore, we developed and validated a generalized probability-based model to assess the sample-specific LOD of mNGS experiments (LOD(mNGS)). Initial rarefaction analyses with datasets of Borna disease virus 1 human encephalitis cases revealed a stochastic behavior of virus read detection. Based on this, we transformed the Bernoulli formula to predict the minimal necessary dataset size to detect one virus read with a probability of 99%. We validated the formula with 30 datasets from diseased individuals, resulting in an accuracy of 99.1% and an average of 4.5 ± 0.4 viral reads found in the calculated minimal dataset size. We demonstrated by modeling the virus genome size, virus-, and total RNA-concentration that the main determinant of mNGS sensitivity is the virus-sample background ratio. The predicted LOD(mNGS) for the respective pathogenic virus in the datasets were congruent with the virus-concentration determined by RT-qPCR. Theoretical assumptions were further confirmed by correlation analysis of mNGS and RT-qPCR data from the samples of the analyzed datasets. This approach should guide standardization of mNGS application, due to the generalized concept of LOD(mNGS). Research Network of Computational and Structural Biotechnology 2020-12-26 /pmc/articles/PMC7822954/ /pubmed/33552445 http://dx.doi.org/10.1016/j.csbj.2020.12.040 Text en © 2021 Friedrich-Loeffler-Institut http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Ebinger, Arnt Fischer, Susanne Höper, Dirk A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics |
title | A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics |
title_full | A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics |
title_fullStr | A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics |
title_full_unstemmed | A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics |
title_short | A theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics |
title_sort | theoretical and generalized approach for the assessment of the sample-specific limit of detection for clinical metagenomics |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7822954/ https://www.ncbi.nlm.nih.gov/pubmed/33552445 http://dx.doi.org/10.1016/j.csbj.2020.12.040 |
work_keys_str_mv | AT ebingerarnt atheoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics AT fischersusanne atheoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics AT hoperdirk atheoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics AT ebingerarnt theoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics AT fischersusanne theoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics AT hoperdirk theoreticalandgeneralizedapproachfortheassessmentofthesamplespecificlimitofdetectionforclinicalmetagenomics |