Cargando…
Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida
OBJECTIVE: This work leverages spatio-temporal data mining (ST-DM), the MiSTIC (Mining Spatio-Temporally Invariant Cores)[1,6: a. Extent of spatial spread of disease core regions across populations-scale of disease prevalence. b. Possible causes of the observed patterns-for better prediction, detect...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
University of Illinois at Chicago Library
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692842/ |
_version_ | 1782274668679397376 |
---|---|
author | Raheja, Vipul Rajan, K. S. |
author_facet | Raheja, Vipul Rajan, K. S. |
author_sort | Raheja, Vipul |
collection | PubMed |
description | OBJECTIVE: This work leverages spatio-temporal data mining (ST-DM), the MiSTIC (Mining Spatio-Temporally Invariant Cores)[1,6: a. Extent of spatial spread of disease core regions across populations-scale of disease prevalence. b. Possible causes of the observed patterns-for better prediction, detection & management of infectious disease & its outbreaks. INTRODUCTION: Infectious diseases, though initially tend to be limited geographically to a reservoir; a subsequent spatial variation in disease prevalence (including spread & intensity) arises from the underlying differences in physical-biological conditions that support pathogen, its vectors & reservoirs. Different factors like spatial proximity, physical & social connectivity, & local environmental conditions which add to its susceptibility influence the occurrence[2]. In Disease management, analysis of historical data over various aspects of geography, epidemiology, social structures & network dynamics need to be accounted for. Large amounts of data raise issues of data processing, storage, pattern identification, etc. In addition, identifying the source of disease occurrence & its pattern can be of immense value. ST-DM of disease data can be an effective tool for endemic pre-paredness[3], as it extracts implicit knowledge, spatial & temporal relationships, or other patterns inherent in such databases. Here, Core Region is defined as a set of spatial entities(eg.counties) aggregated over time, which occur frequently at places having high values in a defined region (considering areas of influence around them)[1]. METHODS: Here, MiSTIC algorithm detects spatio-temporally invariant cores with respect to disease occurrence. It involves both a spatial analysis step to detect focal points & a spatio-temporal analysis over the time period of study to identify core regions, which are then classified as –CHD, CLD & CND. They refer to Cores with High, Low and No (mostly random) dominating points respectively based on frequency of occurrences of disease. The predominantly occurring focal points capture the localized behavior of the disease whereas the neighborhood constraints capture the nature (dynamic or non-dynamic) of the event. RESULTS: County-level annual data of Salmonellosis incidence from Florida Department of Health [3] covering a period of 50 years (1961–2010) is used. Two types of cores were identified based on type of neighborhood - Contiguous (CC) & within a defined Radius (CR). Table 1 shows the analysis of counties according to valid frequency criteria for both CC & CR (r=2) & their sub-classification. Salmonellosis etiology shows that it is caused by tainted food, hygiene, local environment etc. which are largely sanitation-related [4]. Taking the level of urbanization [5] as a proxy for sanitation, it can be seen from Fig. 1, 12 of 19 cores occur in rural counties. CONCLUSIONS: It is observed that CC is better indicator of cores than CR, implying that Salmonellosis manifests itself in a highly localized manner. Thus, use of MiSTIC is promising & provides a way for identifying disease “hot-spots”. It also provides valuable insight into the understanding of disease prevalence in different regions based on their history over space and time. [Table: see text] [Figure: see text] |
format | Online Article Text |
id | pubmed-3692842 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | University of Illinois at Chicago Library |
record_format | MEDLINE/PubMed |
spelling | pubmed-36928422013-06-26 Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida Raheja, Vipul Rajan, K. S. Online J Public Health Inform ISDS 2012 Conference Abstracts OBJECTIVE: This work leverages spatio-temporal data mining (ST-DM), the MiSTIC (Mining Spatio-Temporally Invariant Cores)[1,6: a. Extent of spatial spread of disease core regions across populations-scale of disease prevalence. b. Possible causes of the observed patterns-for better prediction, detection & management of infectious disease & its outbreaks. INTRODUCTION: Infectious diseases, though initially tend to be limited geographically to a reservoir; a subsequent spatial variation in disease prevalence (including spread & intensity) arises from the underlying differences in physical-biological conditions that support pathogen, its vectors & reservoirs. Different factors like spatial proximity, physical & social connectivity, & local environmental conditions which add to its susceptibility influence the occurrence[2]. In Disease management, analysis of historical data over various aspects of geography, epidemiology, social structures & network dynamics need to be accounted for. Large amounts of data raise issues of data processing, storage, pattern identification, etc. In addition, identifying the source of disease occurrence & its pattern can be of immense value. ST-DM of disease data can be an effective tool for endemic pre-paredness[3], as it extracts implicit knowledge, spatial & temporal relationships, or other patterns inherent in such databases. Here, Core Region is defined as a set of spatial entities(eg.counties) aggregated over time, which occur frequently at places having high values in a defined region (considering areas of influence around them)[1]. METHODS: Here, MiSTIC algorithm detects spatio-temporally invariant cores with respect to disease occurrence. It involves both a spatial analysis step to detect focal points & a spatio-temporal analysis over the time period of study to identify core regions, which are then classified as –CHD, CLD & CND. They refer to Cores with High, Low and No (mostly random) dominating points respectively based on frequency of occurrences of disease. The predominantly occurring focal points capture the localized behavior of the disease whereas the neighborhood constraints capture the nature (dynamic or non-dynamic) of the event. RESULTS: County-level annual data of Salmonellosis incidence from Florida Department of Health [3] covering a period of 50 years (1961–2010) is used. Two types of cores were identified based on type of neighborhood - Contiguous (CC) & within a defined Radius (CR). Table 1 shows the analysis of counties according to valid frequency criteria for both CC & CR (r=2) & their sub-classification. Salmonellosis etiology shows that it is caused by tainted food, hygiene, local environment etc. which are largely sanitation-related [4]. Taking the level of urbanization [5] as a proxy for sanitation, it can be seen from Fig. 1, 12 of 19 cores occur in rural counties. CONCLUSIONS: It is observed that CC is better indicator of cores than CR, implying that Salmonellosis manifests itself in a highly localized manner. Thus, use of MiSTIC is promising & provides a way for identifying disease “hot-spots”. It also provides valuable insight into the understanding of disease prevalence in different regions based on their history over space and time. [Table: see text] [Figure: see text] University of Illinois at Chicago Library 2013-04-04 /pmc/articles/PMC3692842/ Text en ©2013 the author(s) http://www.uic.edu/htbin/cgiwrap/bin/ojs/index.php/ojphi/about/submissions#copyrightNotice This is an Open Access article. Authors own copyright of their articles appearing in the Online Journal of Public Health Informatics. Readers may copy articles without permission of the copyright owner(s), as long as the author and OJPHI are acknowledged in the copy and the copy is used for educational, not-for-profit purposes. |
spellingShingle | ISDS 2012 Conference Abstracts Raheja, Vipul Rajan, K. S. Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida |
title | Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida |
title_full | Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida |
title_fullStr | Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida |
title_full_unstemmed | Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida |
title_short | Extraction of Disease Occurrence Patterns Using MiSTIC: Salmonellosis in Florida |
title_sort | extraction of disease occurrence patterns using mistic: salmonellosis in florida |
topic | ISDS 2012 Conference Abstracts |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692842/ |
work_keys_str_mv | AT rahejavipul extractionofdiseaseoccurrencepatternsusingmisticsalmonellosisinflorida AT rajanks extractionofdiseaseoccurrencepatternsusingmisticsalmonellosisinflorida |