Cargando…

Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida

OBJECTIVE: This work leverages spatio-temporal data mining (ST-DM), the MiSTIC (Mining Spatio-Temporally Invariant Cores)[1,6: a. Extent of spatial spread of disease core regions across populations-scale of disease prevalence. b. Possible causes of the observed patterns-for better prediction, detect...

Descripción completa

Detalles Bibliográficos
Autores principales: Raheja, Vipul, Rajan, K. S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: University of Illinois at Chicago Library 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692842/
_version_ 1782274668679397376
author Raheja, Vipul
Rajan, K. S.
author_facet Raheja, Vipul
Rajan, K. S.
author_sort Raheja, Vipul
collection PubMed
description OBJECTIVE: This work leverages spatio-temporal data mining (ST-DM), the MiSTIC (Mining Spatio-Temporally Invariant Cores)[1,6: a. Extent of spatial spread of disease core regions across populations-scale of disease prevalence. b. Possible causes of the observed patterns-for better prediction, detection & management of infectious disease & its outbreaks. INTRODUCTION: Infectious diseases, though initially tend to be limited geographically to a reservoir; a subsequent spatial variation in disease prevalence (including spread & intensity) arises from the underlying differences in physical-biological conditions that support pathogen, its vectors & reservoirs. Different factors like spatial proximity, physical & social connectivity, & local environmental conditions which add to its susceptibility influence the occurrence[2]. In Disease management, analysis of historical data over various aspects of geography, epidemiology, social structures & network dynamics need to be accounted for. Large amounts of data raise issues of data processing, storage, pattern identification, etc. In addition, identifying the source of disease occurrence & its pattern can be of immense value. ST-DM of disease data can be an effective tool for endemic pre-paredness[3], as it extracts implicit knowledge, spatial & temporal relationships, or other patterns inherent in such databases. Here, Core Region is defined as a set of spatial entities(eg.counties) aggregated over time, which occur frequently at places having high values in a defined region (considering areas of influence around them)[1]. METHODS: Here, MiSTIC algorithm detects spatio-temporally invariant cores with respect to disease occurrence. It involves both a spatial analysis step to detect focal points & a spatio-temporal analysis over the time period of study to identify core regions, which are then classified as –CHD, CLD & CND. They refer to Cores with High, Low and No (mostly random) dominating points respectively based on frequency of occurrences of disease. The predominantly occurring focal points capture the localized behavior of the disease whereas the neighborhood constraints capture the nature (dynamic or non-dynamic) of the event. RESULTS: County-level annual data of Salmonellosis incidence from Florida Department of Health [3] covering a period of 50 years (1961–2010) is used. Two types of cores were identified based on type of neighborhood - Contiguous (CC) & within a defined Radius (CR). Table 1 shows the analysis of counties according to valid frequency criteria for both CC & CR (r=2) & their sub-classification. Salmonellosis etiology shows that it is caused by tainted food, hygiene, local environment etc. which are largely sanitation-related [4]. Taking the level of urbanization [5] as a proxy for sanitation, it can be seen from Fig. 1, 12 of 19 cores occur in rural counties. CONCLUSIONS: It is observed that CC is better indicator of cores than CR, implying that Salmonellosis manifests itself in a highly localized manner. Thus, use of MiSTIC is promising & provides a way for identifying disease “hot-spots”. It also provides valuable insight into the understanding of disease prevalence in different regions based on their history over space and time. [Table: see text] [Figure: see text]
format Online
Article
Text
id pubmed-3692842
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher University of Illinois at Chicago Library
record_format MEDLINE/PubMed
spelling pubmed-36928422013-06-26 Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida Raheja, Vipul Rajan, K. S. Online J Public Health Inform ISDS 2012 Conference Abstracts OBJECTIVE: This work leverages spatio-temporal data mining (ST-DM), the MiSTIC (Mining Spatio-Temporally Invariant Cores)[1,6: a. Extent of spatial spread of disease core regions across populations-scale of disease prevalence. b. Possible causes of the observed patterns-for better prediction, detection & management of infectious disease & its outbreaks. INTRODUCTION: Infectious diseases, though initially tend to be limited geographically to a reservoir; a subsequent spatial variation in disease prevalence (including spread & intensity) arises from the underlying differences in physical-biological conditions that support pathogen, its vectors & reservoirs. Different factors like spatial proximity, physical & social connectivity, & local environmental conditions which add to its susceptibility influence the occurrence[2]. In Disease management, analysis of historical data over various aspects of geography, epidemiology, social structures & network dynamics need to be accounted for. Large amounts of data raise issues of data processing, storage, pattern identification, etc. In addition, identifying the source of disease occurrence & its pattern can be of immense value. ST-DM of disease data can be an effective tool for endemic pre-paredness[3], as it extracts implicit knowledge, spatial & temporal relationships, or other patterns inherent in such databases. Here, Core Region is defined as a set of spatial entities(eg.counties) aggregated over time, which occur frequently at places having high values in a defined region (considering areas of influence around them)[1]. METHODS: Here, MiSTIC algorithm detects spatio-temporally invariant cores with respect to disease occurrence. It involves both a spatial analysis step to detect focal points & a spatio-temporal analysis over the time period of study to identify core regions, which are then classified as –CHD, CLD & CND. They refer to Cores with High, Low and No (mostly random) dominating points respectively based on frequency of occurrences of disease. The predominantly occurring focal points capture the localized behavior of the disease whereas the neighborhood constraints capture the nature (dynamic or non-dynamic) of the event. RESULTS: County-level annual data of Salmonellosis incidence from Florida Department of Health [3] covering a period of 50 years (1961–2010) is used. Two types of cores were identified based on type of neighborhood - Contiguous (CC) & within a defined Radius (CR). Table 1 shows the analysis of counties according to valid frequency criteria for both CC & CR (r=2) & their sub-classification. Salmonellosis etiology shows that it is caused by tainted food, hygiene, local environment etc. which are largely sanitation-related [4]. Taking the level of urbanization [5] as a proxy for sanitation, it can be seen from Fig. 1, 12 of 19 cores occur in rural counties. CONCLUSIONS: It is observed that CC is better indicator of cores than CR, implying that Salmonellosis manifests itself in a highly localized manner. Thus, use of MiSTIC is promising & provides a way for identifying disease “hot-spots”. It also provides valuable insight into the understanding of disease prevalence in different regions based on their history over space and time. [Table: see text] [Figure: see text] University of Illinois at Chicago Library 2013-04-04 /pmc/articles/PMC3692842/ Text en ©2013 the author(s) http://www.uic.edu/htbin/cgiwrap/bin/ojs/index.php/ojphi/about/submissions#copyrightNotice This is an Open Access article. Authors own copyright of their articles appearing in the Online Journal of Public Health Informatics. Readers may copy articles without permission of the copyright owner(s), as long as the author and OJPHI are acknowledged in the copy and the copy is used for educational, not-for-profit purposes.
spellingShingle ISDS 2012 Conference Abstracts
Raheja, Vipul
Rajan, K. S.
Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida
title Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida
title_full Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida
title_fullStr Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida
title_full_unstemmed Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida
title_short Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida
title_sort extraction of disease occurrence patterns using mistic: salmonellosis in florida
topic ISDS 2012 Conference Abstracts
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692842/
work_keys_str_mv AT rahejavipul extractionofdiseaseoccurrencepatternsusingmisticsalmonellosisinflorida
AT rajanks extractionofdiseaseoccurrencepatternsusingmisticsalmonellosisinflorida