Cargando…
A Grid Based Approach to Share Public Health Surveillance Applications - The R Example
OBJECTIVE: This poster describes an approach which leverages grid technology for the epidemiological analysis of public health data. Through a virtual environment, users, particularly epidemiologists, and others unfamiliar with the application, can perform on-demand powerful statistical analyses. IN...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
University of Illinois at Chicago Library
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692782/ |
_version_ | 1782274654033936384 |
---|---|
author | Davis, Kailah Facelli, Julio |
author_facet | Davis, Kailah Facelli, Julio |
author_sort | Davis, Kailah |
collection | PubMed |
description | OBJECTIVE: This poster describes an approach which leverages grid technology for the epidemiological analysis of public health data. Through a virtual environment, users, particularly epidemiologists, and others unfamiliar with the application, can perform on-demand powerful statistical analyses. INTRODUCTION: Currently, there’s little effective communication and collaboration among public health departments. The lack of collaboration has resulted in more than 300 separate biosurveillance systems (1), which are disease specific, not integrated or interoperable, and may be duplicative (1). Grid architecture is a promising methodology to aid in building a decentralized health surveillance infrastructure because it encourages an ecosystem development culture (2), which has the potential to increase collaboration and decrease duplications. METHODS: This project had two major steps: creation and validation of the grid service. For the first step [creation of the service], we first determined the parameter set required to execute R from the command line. We then used the caGrid Introduce toolkit (3) and Grid Rapid Application Virtualization Interface (gRAVI) (4) to wrap the R command line interface into a grid service. The service was then deployed to the caGrid training grid. After deployment, the service was invoked using the R grid service client which was automatically created by Introduce and gRAVI. Our second step was aimed at validating the service by using using the grid service client to illustrate the working principles of R in a grid environment. For this illustration, we selected the article by Hohle et al (5). In this article, the ‘surveillance’ package was developed to provide different algorithms for the detection of aberrations in routinely collected surveillance data. For validation purposes, only a subset of the analyses presented in the article, namely the Farrington and CUSUM algorithms, were reproduced. Using the grid web client, we uploaded the necessary data files for processing, as well as the Rscript which was used to replicate the results of (5). The application then ran the R script on the execution machine; this machine had all the necessary R packages needed for the specific scenario. RESULTS: The implementation of was validated by showing that the results of the original paper can be reproduced using gird based version of R. Figure 1 shows the plots related to the steps described above; the plots illustrating the Farrington and CUSUM algorithms are seen to be identical to that in (5). CONCLUSIONS: We demonstrated that it is possible to easily deploy applications for public health surveillance uses. We conclude that the techniques we used could be generalized to any application that has a command line interface. Future work will be aimed creating a workflow to access data services and grid-enabled text processing and analytic tools. We believe that by providing a set of examples to demonstrate the benefit of this technology to public health surveillance infrastructure may provide insight that may lead to a better, more collaborative system of tools that will become the future of public health surveillance. |
format | Online Article Text |
id | pubmed-3692782 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | University of Illinois at Chicago Library |
record_format | MEDLINE/PubMed |
spelling | pubmed-36927822013-06-26 A Grid Based Approach to Share Public Health Surveillance Applications - The R Example Davis, Kailah Facelli, Julio Online J Public Health Inform ISDS 2012 Conference Abstracts OBJECTIVE: This poster describes an approach which leverages grid technology for the epidemiological analysis of public health data. Through a virtual environment, users, particularly epidemiologists, and others unfamiliar with the application, can perform on-demand powerful statistical analyses. INTRODUCTION: Currently, there’s little effective communication and collaboration among public health departments. The lack of collaboration has resulted in more than 300 separate biosurveillance systems (1), which are disease specific, not integrated or interoperable, and may be duplicative (1). Grid architecture is a promising methodology to aid in building a decentralized health surveillance infrastructure because it encourages an ecosystem development culture (2), which has the potential to increase collaboration and decrease duplications. METHODS: This project had two major steps: creation and validation of the grid service. For the first step [creation of the service], we first determined the parameter set required to execute R from the command line. We then used the caGrid Introduce toolkit (3) and Grid Rapid Application Virtualization Interface (gRAVI) (4) to wrap the R command line interface into a grid service. The service was then deployed to the caGrid training grid. After deployment, the service was invoked using the R grid service client which was automatically created by Introduce and gRAVI. Our second step was aimed at validating the service by using using the grid service client to illustrate the working principles of R in a grid environment. For this illustration, we selected the article by Hohle et al (5). In this article, the ‘surveillance’ package was developed to provide different algorithms for the detection of aberrations in routinely collected surveillance data. For validation purposes, only a subset of the analyses presented in the article, namely the Farrington and CUSUM algorithms, were reproduced. Using the grid web client, we uploaded the necessary data files for processing, as well as the Rscript which was used to replicate the results of (5). The application then ran the R script on the execution machine; this machine had all the necessary R packages needed for the specific scenario. RESULTS: The implementation of was validated by showing that the results of the original paper can be reproduced using gird based version of R. Figure 1 shows the plots related to the steps described above; the plots illustrating the Farrington and CUSUM algorithms are seen to be identical to that in (5). CONCLUSIONS: We demonstrated that it is possible to easily deploy applications for public health surveillance uses. We conclude that the techniques we used could be generalized to any application that has a command line interface. Future work will be aimed creating a workflow to access data services and grid-enabled text processing and analytic tools. We believe that by providing a set of examples to demonstrate the benefit of this technology to public health surveillance infrastructure may provide insight that may lead to a better, more collaborative system of tools that will become the future of public health surveillance. University of Illinois at Chicago Library 2013-04-04 /pmc/articles/PMC3692782/ Text en ©2013 the author(s) http://www.uic.edu/htbin/cgiwrap/bin/ojs/index.php/ojphi/about/submissions#copyrightNotice This is an Open Access article. Authors own copyright of their articles appearing in the Online Journal of Public Health Informatics. Readers may copy articles without permission of the copyright owner(s), as long as the author and OJPHI are acknowledged in the copy and the copy is used for educational, not-for-profit purposes. |
spellingShingle | ISDS 2012 Conference Abstracts Davis, Kailah Facelli, Julio A Grid Based Approach to Share Public Health Surveillance Applications - The R Example |
title | A Grid Based Approach to Share Public Health Surveillance Applications - The R Example |
title_full | A Grid Based Approach to Share Public Health Surveillance Applications - The R Example |
title_fullStr | A Grid Based Approach to Share Public Health Surveillance Applications - The R Example |
title_full_unstemmed | A Grid Based Approach to Share Public Health Surveillance Applications - The R Example |
title_short | A Grid Based Approach to Share Public Health Surveillance Applications - The R Example |
title_sort | grid based approach to share public health surveillance applications - the r example |
topic | ISDS 2012 Conference Abstracts |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692782/ |
work_keys_str_mv | AT daviskailah agridbasedapproachtosharepublichealthsurveillanceapplicationstherexample AT facellijulio agridbasedapproachtosharepublichealthsurveillanceapplicationstherexample AT daviskailah gridbasedapproachtosharepublichealthsurveillanceapplicationstherexample AT facellijulio gridbasedapproachtosharepublichealthsurveillanceapplicationstherexample |