Cargando…

Bias-corrected maximum-likelihood estimation of multiplicity of infection and lineage frequencies

BACKGROUND: The UN’s Sustainable Development Goals are devoted to eradicate a range of infectious diseases to achieve global well-being. These efforts require monitoring disease transmission at a level that differentiates between pathogen variants at the genetic/molecular level. In fact, the advanta...

Descripción completa

Detalles Bibliográficos
Autores principales: Hashemi, Meraj, Schneider, Kristan A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716058/
https://www.ncbi.nlm.nih.gov/pubmed/34965279
http://dx.doi.org/10.1371/journal.pone.0261889
_version_ 1784624242518654976
author Hashemi, Meraj
Schneider, Kristan A.
author_facet Hashemi, Meraj
Schneider, Kristan A.
author_sort Hashemi, Meraj
collection PubMed
description BACKGROUND: The UN’s Sustainable Development Goals are devoted to eradicate a range of infectious diseases to achieve global well-being. These efforts require monitoring disease transmission at a level that differentiates between pathogen variants at the genetic/molecular level. In fact, the advantages of genetic (molecular) measures like multiplicity of infection (MOI) over traditional metrics, e.g., R(0), are being increasingly recognized. MOI refers to the presence of multiple pathogen variants within an infection due to multiple infective contacts. Maximum-likelihood (ML) methods have been proposed to derive MOI and pathogen-lineage frequencies from molecular data. However, these methods are biased. METHODS AND FINDINGS: Based on a single molecular marker, we derive a bias-corrected ML estimator for MOI and pathogen-lineage frequencies. We further improve these estimators by heuristical adjustments that compensate shortcomings in the derivation of the bias correction, which implicitly assumes that data lies in the interior of the observational space. The finite sample properties of the different variants of the bias-corrected estimators are investigated by a systematic simulation study. In particular, we investigate the performance of the estimator in terms of bias, variance, and robustness against model violations. The corrections successfully remove bias except for extreme parameters that likely yield uninformative data, which cannot sustain accurate parameter estimation. Heuristic adjustments further improve the bias correction, particularly for small sample sizes. The bias corrections also reduce the estimators’ variances, which coincide with the Cramér-Rao lower bound. The estimators are reasonably robust against model violations. CONCLUSIONS: Applying bias corrections can substantially improve the quality of MOI estimates, particularly in areas of low as well as areas of high transmission—in both cases estimates tend to be biased. The bias-corrected estimators are (almost) unbiased and their variance coincides with the Cramér-Rao lower bound, suggesting that no further improvements are possible unless additional information is provided. Additional information can be obtained by combining data from several molecular markers, or by including information that allows stratifying the data into heterogeneous groups.
format Online
Article
Text
id pubmed-8716058
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87160582021-12-30 Bias-corrected maximum-likelihood estimation of multiplicity of infection and lineage frequencies Hashemi, Meraj Schneider, Kristan A. PLoS One Research Article BACKGROUND: The UN’s Sustainable Development Goals are devoted to eradicate a range of infectious diseases to achieve global well-being. These efforts require monitoring disease transmission at a level that differentiates between pathogen variants at the genetic/molecular level. In fact, the advantages of genetic (molecular) measures like multiplicity of infection (MOI) over traditional metrics, e.g., R(0), are being increasingly recognized. MOI refers to the presence of multiple pathogen variants within an infection due to multiple infective contacts. Maximum-likelihood (ML) methods have been proposed to derive MOI and pathogen-lineage frequencies from molecular data. However, these methods are biased. METHODS AND FINDINGS: Based on a single molecular marker, we derive a bias-corrected ML estimator for MOI and pathogen-lineage frequencies. We further improve these estimators by heuristical adjustments that compensate shortcomings in the derivation of the bias correction, which implicitly assumes that data lies in the interior of the observational space. The finite sample properties of the different variants of the bias-corrected estimators are investigated by a systematic simulation study. In particular, we investigate the performance of the estimator in terms of bias, variance, and robustness against model violations. The corrections successfully remove bias except for extreme parameters that likely yield uninformative data, which cannot sustain accurate parameter estimation. Heuristic adjustments further improve the bias correction, particularly for small sample sizes. The bias corrections also reduce the estimators’ variances, which coincide with the Cramér-Rao lower bound. The estimators are reasonably robust against model violations. CONCLUSIONS: Applying bias corrections can substantially improve the quality of MOI estimates, particularly in areas of low as well as areas of high transmission—in both cases estimates tend to be biased. The bias-corrected estimators are (almost) unbiased and their variance coincides with the Cramér-Rao lower bound, suggesting that no further improvements are possible unless additional information is provided. Additional information can be obtained by combining data from several molecular markers, or by including information that allows stratifying the data into heterogeneous groups. Public Library of Science 2021-12-29 /pmc/articles/PMC8716058/ /pubmed/34965279 http://dx.doi.org/10.1371/journal.pone.0261889 Text en © 2021 Hashemi, Schneider https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hashemi, Meraj
Schneider, Kristan A.
Bias-corrected maximum-likelihood estimation of multiplicity of infection and lineage frequencies
title Bias-corrected maximum-likelihood estimation of multiplicity of infection and lineage frequencies
title_full Bias-corrected maximum-likelihood estimation of multiplicity of infection and lineage frequencies
title_fullStr Bias-corrected maximum-likelihood estimation of multiplicity of infection and lineage frequencies
title_full_unstemmed Bias-corrected maximum-likelihood estimation of multiplicity of infection and lineage frequencies
title_short Bias-corrected maximum-likelihood estimation of multiplicity of infection and lineage frequencies
title_sort bias-corrected maximum-likelihood estimation of multiplicity of infection and lineage frequencies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716058/
https://www.ncbi.nlm.nih.gov/pubmed/34965279
http://dx.doi.org/10.1371/journal.pone.0261889
work_keys_str_mv AT hashemimeraj biascorrectedmaximumlikelihoodestimationofmultiplicityofinfectionandlineagefrequencies
AT schneiderkristana biascorrectedmaximumlikelihoodestimationofmultiplicityofinfectionandlineagefrequencies