Cargando…

Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures

Molecular epidemiologic studies of malaria parasites commonly employ amplicon deep sequencing (AmpSeq) of marker genes derived from dried blood spots (DBS) to answer public health questions related to topics such as transmission and drug resistance. As these methods are increasingly employed to info...

Descripción completa

Detalles Bibliográficos
Autores principales: Lapp, Zena, Freedman, Elizabeth, Huang, Kathie, Markwalter, Christine F, Obala, Andrew A, Prudhomme-O’Meara, Wendy, Taylor, Steve M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10473802/
https://www.ncbi.nlm.nih.gov/pubmed/37662206
http://dx.doi.org/10.1101/2023.08.17.23294237
_version_ 1785100338208964608
author Lapp, Zena
Freedman, Elizabeth
Huang, Kathie
Markwalter, Christine F
Obala, Andrew A
Prudhomme-O’Meara, Wendy
Taylor, Steve M
author_facet Lapp, Zena
Freedman, Elizabeth
Huang, Kathie
Markwalter, Christine F
Obala, Andrew A
Prudhomme-O’Meara, Wendy
Taylor, Steve M
author_sort Lapp, Zena
collection PubMed
description Molecular epidemiologic studies of malaria parasites commonly employ amplicon deep sequencing (AmpSeq) of marker genes derived from dried blood spots (DBS) to answer public health questions related to topics such as transmission and drug resistance. As these methods are increasingly employed to inform direct public health action, it is important to rigorously evaluate the risk of false positive and false negative haplotypes derived from clinically-relevant sample types. We performed a control experiment evaluating haplotype recovery from AmpSeq of 5 marker genes (ama1, csp, msp7, sera2, and trap) from DBS containing mixtures of DNA from 1 to 10 known P. falciparum reference strains across 3 parasite densities in triplicate (n=270 samples). While false positive haplotypes were present across all parasite densities and mixtures, we optimized censoring criteria to remove 83% (148/179) of false positives while removing only 8% (67/859) of true positives. Post-censoring, the median pairwise Jaccard distance between replicates was 0.83. We failed to recover 35% (477/1365) of haplotypes expected to be present in the sample. Haplotypes were more likely to be missed in low-density samples with <1.5 genomes/μL (OR: 3.88, CI: 1.82–8.27, vs. high-density samples with ≥75 genomes/μL) and in samples with lower read depth (OR per 10,000 reads: 0.61, CI: 0.54–0.69). Furthermore, minority haplotypes within a sample were more likely to be missed than dominant haplotypes (OR per 0.01 increase in proportion: 0.96, CI: 0.96–0.97). Finally, in clinical samples the percent concordance across markers for multiplicity of infection ranged from 40%–80%. Taken together, our observations indicate that, with sufficient read depth, haplotypes can be successfully recovered from DBS while limiting the false positive rate.
format Online
Article
Text
id pubmed-10473802
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104738022023-09-02 Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures Lapp, Zena Freedman, Elizabeth Huang, Kathie Markwalter, Christine F Obala, Andrew A Prudhomme-O’Meara, Wendy Taylor, Steve M medRxiv Article Molecular epidemiologic studies of malaria parasites commonly employ amplicon deep sequencing (AmpSeq) of marker genes derived from dried blood spots (DBS) to answer public health questions related to topics such as transmission and drug resistance. As these methods are increasingly employed to inform direct public health action, it is important to rigorously evaluate the risk of false positive and false negative haplotypes derived from clinically-relevant sample types. We performed a control experiment evaluating haplotype recovery from AmpSeq of 5 marker genes (ama1, csp, msp7, sera2, and trap) from DBS containing mixtures of DNA from 1 to 10 known P. falciparum reference strains across 3 parasite densities in triplicate (n=270 samples). While false positive haplotypes were present across all parasite densities and mixtures, we optimized censoring criteria to remove 83% (148/179) of false positives while removing only 8% (67/859) of true positives. Post-censoring, the median pairwise Jaccard distance between replicates was 0.83. We failed to recover 35% (477/1365) of haplotypes expected to be present in the sample. Haplotypes were more likely to be missed in low-density samples with <1.5 genomes/μL (OR: 3.88, CI: 1.82–8.27, vs. high-density samples with ≥75 genomes/μL) and in samples with lower read depth (OR per 10,000 reads: 0.61, CI: 0.54–0.69). Furthermore, minority haplotypes within a sample were more likely to be missed than dominant haplotypes (OR per 0.01 increase in proportion: 0.96, CI: 0.96–0.97). Finally, in clinical samples the percent concordance across markers for multiplicity of infection ranged from 40%–80%. Taken together, our observations indicate that, with sufficient read depth, haplotypes can be successfully recovered from DBS while limiting the false positive rate. Cold Spring Harbor Laboratory 2023-08-23 /pmc/articles/PMC10473802/ /pubmed/37662206 http://dx.doi.org/10.1101/2023.08.17.23294237 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Lapp, Zena
Freedman, Elizabeth
Huang, Kathie
Markwalter, Christine F
Obala, Andrew A
Prudhomme-O’Meara, Wendy
Taylor, Steve M
Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures
title Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures
title_full Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures
title_fullStr Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures
title_full_unstemmed Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures
title_short Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures
title_sort analytic optimization of plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10473802/
https://www.ncbi.nlm.nih.gov/pubmed/37662206
http://dx.doi.org/10.1101/2023.08.17.23294237
work_keys_str_mv AT lappzena analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures
AT freedmanelizabeth analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures
AT huangkathie analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures
AT markwalterchristinef analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures
AT obalaandrewa analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures
AT prudhommeomearawendy analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures
AT taylorstevem analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures