Cargando…
Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures
Molecular epidemiologic studies of malaria parasites commonly employ amplicon deep sequencing (AmpSeq) of marker genes derived from dried blood spots (DBS) to answer public health questions related to topics such as transmission and drug resistance. As these methods are increasingly employed to info...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10473802/ https://www.ncbi.nlm.nih.gov/pubmed/37662206 http://dx.doi.org/10.1101/2023.08.17.23294237 |
_version_ | 1785100338208964608 |
---|---|
author | Lapp, Zena Freedman, Elizabeth Huang, Kathie Markwalter, Christine F Obala, Andrew A Prudhomme-O’Meara, Wendy Taylor, Steve M |
author_facet | Lapp, Zena Freedman, Elizabeth Huang, Kathie Markwalter, Christine F Obala, Andrew A Prudhomme-O’Meara, Wendy Taylor, Steve M |
author_sort | Lapp, Zena |
collection | PubMed |
description | Molecular epidemiologic studies of malaria parasites commonly employ amplicon deep sequencing (AmpSeq) of marker genes derived from dried blood spots (DBS) to answer public health questions related to topics such as transmission and drug resistance. As these methods are increasingly employed to inform direct public health action, it is important to rigorously evaluate the risk of false positive and false negative haplotypes derived from clinically-relevant sample types. We performed a control experiment evaluating haplotype recovery from AmpSeq of 5 marker genes (ama1, csp, msp7, sera2, and trap) from DBS containing mixtures of DNA from 1 to 10 known P. falciparum reference strains across 3 parasite densities in triplicate (n=270 samples). While false positive haplotypes were present across all parasite densities and mixtures, we optimized censoring criteria to remove 83% (148/179) of false positives while removing only 8% (67/859) of true positives. Post-censoring, the median pairwise Jaccard distance between replicates was 0.83. We failed to recover 35% (477/1365) of haplotypes expected to be present in the sample. Haplotypes were more likely to be missed in low-density samples with <1.5 genomes/μL (OR: 3.88, CI: 1.82–8.27, vs. high-density samples with ≥75 genomes/μL) and in samples with lower read depth (OR per 10,000 reads: 0.61, CI: 0.54–0.69). Furthermore, minority haplotypes within a sample were more likely to be missed than dominant haplotypes (OR per 0.01 increase in proportion: 0.96, CI: 0.96–0.97). Finally, in clinical samples the percent concordance across markers for multiplicity of infection ranged from 40%–80%. Taken together, our observations indicate that, with sufficient read depth, haplotypes can be successfully recovered from DBS while limiting the false positive rate. |
format | Online Article Text |
id | pubmed-10473802 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-104738022023-09-02 Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures Lapp, Zena Freedman, Elizabeth Huang, Kathie Markwalter, Christine F Obala, Andrew A Prudhomme-O’Meara, Wendy Taylor, Steve M medRxiv Article Molecular epidemiologic studies of malaria parasites commonly employ amplicon deep sequencing (AmpSeq) of marker genes derived from dried blood spots (DBS) to answer public health questions related to topics such as transmission and drug resistance. As these methods are increasingly employed to inform direct public health action, it is important to rigorously evaluate the risk of false positive and false negative haplotypes derived from clinically-relevant sample types. We performed a control experiment evaluating haplotype recovery from AmpSeq of 5 marker genes (ama1, csp, msp7, sera2, and trap) from DBS containing mixtures of DNA from 1 to 10 known P. falciparum reference strains across 3 parasite densities in triplicate (n=270 samples). While false positive haplotypes were present across all parasite densities and mixtures, we optimized censoring criteria to remove 83% (148/179) of false positives while removing only 8% (67/859) of true positives. Post-censoring, the median pairwise Jaccard distance between replicates was 0.83. We failed to recover 35% (477/1365) of haplotypes expected to be present in the sample. Haplotypes were more likely to be missed in low-density samples with <1.5 genomes/μL (OR: 3.88, CI: 1.82–8.27, vs. high-density samples with ≥75 genomes/μL) and in samples with lower read depth (OR per 10,000 reads: 0.61, CI: 0.54–0.69). Furthermore, minority haplotypes within a sample were more likely to be missed than dominant haplotypes (OR per 0.01 increase in proportion: 0.96, CI: 0.96–0.97). Finally, in clinical samples the percent concordance across markers for multiplicity of infection ranged from 40%–80%. Taken together, our observations indicate that, with sufficient read depth, haplotypes can be successfully recovered from DBS while limiting the false positive rate. Cold Spring Harbor Laboratory 2023-08-23 /pmc/articles/PMC10473802/ /pubmed/37662206 http://dx.doi.org/10.1101/2023.08.17.23294237 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Lapp, Zena Freedman, Elizabeth Huang, Kathie Markwalter, Christine F Obala, Andrew A Prudhomme-O’Meara, Wendy Taylor, Steve M Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures |
title | Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures |
title_full | Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures |
title_fullStr | Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures |
title_full_unstemmed | Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures |
title_short | Analytic optimization of Plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures |
title_sort | analytic optimization of plasmodium falciparum marker gene haplotype recovery from amplicon deep sequencing of complex mixtures |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10473802/ https://www.ncbi.nlm.nih.gov/pubmed/37662206 http://dx.doi.org/10.1101/2023.08.17.23294237 |
work_keys_str_mv | AT lappzena analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures AT freedmanelizabeth analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures AT huangkathie analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures AT markwalterchristinef analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures AT obalaandrewa analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures AT prudhommeomearawendy analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures AT taylorstevem analyticoptimizationofplasmodiumfalciparummarkergenehaplotyperecoveryfromamplicondeepsequencingofcomplexmixtures |