Cargando…

Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore

BACKGROUND: Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. W...

Descripción completa

Detalles Bibliográficos
Autores principales: Sangkaew, Sorawat, Tan, Li Kiang, Ng, Lee Ching, Ferguson, Neil M., Dorigatti, Ilaria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6969465/
https://www.ncbi.nlm.nih.gov/pubmed/31952539
http://dx.doi.org/10.1186/s13071-020-3898-5
_version_ 1783489335620796416
author Sangkaew, Sorawat
Tan, Li Kiang
Ng, Lee Ching
Ferguson, Neil M.
Dorigatti, Ilaria
author_facet Sangkaew, Sorawat
Tan, Li Kiang
Ng, Lee Ching
Ferguson, Neil M.
Dorigatti, Ilaria
author_sort Sangkaew, Sorawat
collection PubMed
description BACKGROUND: Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. We analysed serotype-specific antibody levels obtained by PRNT in two serological surveys conducted in Singapore in 2009 and 2013 using cluster analysis, a machine learning technique that was used to identify the most common histories of DENV exposure. METHODS: We explored the use of five distinct clustering methods (i.e. agglomerative hierarchical, divisive hierarchical, K-means, K-medoids and model-based clustering) with varying number (from 4 to 10) of clusters for each method. Weighted rank aggregation, an evaluating technique for a set of internal validity metrics, was adopted to determine the optimal algorithm, comprising the optimal clustering method and the optimal number of clusters. RESULTS: The K-means algorithm with six clusters was selected as the algorithm with the highest weighted rank aggregation. The six clusters were characterised by (i) dominant DENV2 PRNT titres; (ii) co-dominant DENV1 and DENV2 titres with average DENV2 titre > average DENV1 titre; (iii) co-dominant DENV1 and DENV2 titres with average DENV1 titre > average DENV2 titre; (iv) low PRNT titres against DENV1-4; (v) intermediate PRNT titres against DENV1-4; and (vi) dominant DENV1-3 titres. Analyses of the relative size and age-stratification of the clusters by year of sample collection and the application of cluster analysis to the 2009 and 2013 datasets considered separately revealed the epidemic circulation of DENV2 and DENV3 between 2009 and 2013. CONCLUSION: Cluster analysis is an unsupervised machine learning technique that can be applied to analyse PRNT antibody titres (without pre-established cut-off thresholds to indicate protection) to explore common patterns of DENV infection and infer the likely history of dengue exposure in a population.
format Online
Article
Text
id pubmed-6969465
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69694652020-01-27 Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore Sangkaew, Sorawat Tan, Li Kiang Ng, Lee Ching Ferguson, Neil M. Dorigatti, Ilaria Parasit Vectors Research BACKGROUND: Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. We analysed serotype-specific antibody levels obtained by PRNT in two serological surveys conducted in Singapore in 2009 and 2013 using cluster analysis, a machine learning technique that was used to identify the most common histories of DENV exposure. METHODS: We explored the use of five distinct clustering methods (i.e. agglomerative hierarchical, divisive hierarchical, K-means, K-medoids and model-based clustering) with varying number (from 4 to 10) of clusters for each method. Weighted rank aggregation, an evaluating technique for a set of internal validity metrics, was adopted to determine the optimal algorithm, comprising the optimal clustering method and the optimal number of clusters. RESULTS: The K-means algorithm with six clusters was selected as the algorithm with the highest weighted rank aggregation. The six clusters were characterised by (i) dominant DENV2 PRNT titres; (ii) co-dominant DENV1 and DENV2 titres with average DENV2 titre > average DENV1 titre; (iii) co-dominant DENV1 and DENV2 titres with average DENV1 titre > average DENV2 titre; (iv) low PRNT titres against DENV1-4; (v) intermediate PRNT titres against DENV1-4; and (vi) dominant DENV1-3 titres. Analyses of the relative size and age-stratification of the clusters by year of sample collection and the application of cluster analysis to the 2009 and 2013 datasets considered separately revealed the epidemic circulation of DENV2 and DENV3 between 2009 and 2013. CONCLUSION: Cluster analysis is an unsupervised machine learning technique that can be applied to analyse PRNT antibody titres (without pre-established cut-off thresholds to indicate protection) to explore common patterns of DENV infection and infer the likely history of dengue exposure in a population. BioMed Central 2020-01-17 /pmc/articles/PMC6969465/ /pubmed/31952539 http://dx.doi.org/10.1186/s13071-020-3898-5 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Sangkaew, Sorawat
Tan, Li Kiang
Ng, Lee Ching
Ferguson, Neil M.
Dorigatti, Ilaria
Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_full Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_fullStr Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_full_unstemmed Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_short Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore
title_sort using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in singapore
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6969465/
https://www.ncbi.nlm.nih.gov/pubmed/31952539
http://dx.doi.org/10.1186/s13071-020-3898-5
work_keys_str_mv AT sangkaewsorawat usingclusteranalysistoreconstructdengueexposurepatternsfromcrosssectionalserologicalstudiesinsingapore
AT tanlikiang usingclusteranalysistoreconstructdengueexposurepatternsfromcrosssectionalserologicalstudiesinsingapore
AT ngleeching usingclusteranalysistoreconstructdengueexposurepatternsfromcrosssectionalserologicalstudiesinsingapore
AT fergusonneilm usingclusteranalysistoreconstructdengueexposurepatternsfromcrosssectionalserologicalstudiesinsingapore
AT dorigattiilaria usingclusteranalysistoreconstructdengueexposurepatternsfromcrosssectionalserologicalstudiesinsingapore