Cargando…

Big Data Warehouse for Healthcare-Sensitive Data Applications

Obesity is a major public health problem worldwide, and the prevalence of childhood obesity is of particular concern. Effective interventions for preventing and treating childhood obesity aim to change behaviour and exposure at the individual, community, and societal levels. However, monitoring and...

Descripción completa

Detalles Bibliográficos
Autores principales: Shahid, Arsalan, Nguyen, Thien-An Ngoc, Kechadi, M-Tahar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8037603/
https://www.ncbi.nlm.nih.gov/pubmed/33800574
http://dx.doi.org/10.3390/s21072353
_version_ 1783677182396071936
author Shahid, Arsalan
Nguyen, Thien-An Ngoc
Kechadi, M-Tahar
author_facet Shahid, Arsalan
Nguyen, Thien-An Ngoc
Kechadi, M-Tahar
author_sort Shahid, Arsalan
collection PubMed
description Obesity is a major public health problem worldwide, and the prevalence of childhood obesity is of particular concern. Effective interventions for preventing and treating childhood obesity aim to change behaviour and exposure at the individual, community, and societal levels. However, monitoring and evaluating such changes is very challenging. The EU Horizon 2020 project “Big Data against Childhood Obesity (BigO)” aims at gathering large-scale data from a large number of children using different sensor technologies to create comprehensive obesity prevalence models for data-driven predictions about specific policies on a community. It further provides real-time monitoring of the population responses, supported by meaningful real-time data analysis and visualisations. Since BigO involves monitoring and storing of personal data related to the behaviours of a potentially vulnerable population, the data representation, security, and access control are crucial. In this paper, we briefly present the BigO system architecture and focus on the necessary components of the system that deals with data access control, storage, anonymisation, and the corresponding interfaces with the rest of the system. We propose a three-layered data warehouse architecture: The back-end layer consists of a database management system for data collection, de-identification, and anonymisation of the original datasets. The role-based permissions and secured views are implemented in the access control layer. Lastly, the controller layer regulates the data access protocols for any data access and data analysis. We further present the data representation methods and the storage models considering the privacy and security mechanisms. The data privacy and security plans are devised based on the types of collected personal, the types of users, data storage, data transmission, and data analysis. We discuss in detail the challenges of privacy protection in this large distributed data-driven application and implement novel privacy-aware data analysis protocols to ensure that the proposed models guarantee the privacy and security of datasets. Finally, we present the BigO system architecture and its implementation that integrates privacy-aware protocols.
format Online
Article
Text
id pubmed-8037603
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80376032021-04-12 Big Data Warehouse for Healthcare-Sensitive Data Applications Shahid, Arsalan Nguyen, Thien-An Ngoc Kechadi, M-Tahar Sensors (Basel) Article Obesity is a major public health problem worldwide, and the prevalence of childhood obesity is of particular concern. Effective interventions for preventing and treating childhood obesity aim to change behaviour and exposure at the individual, community, and societal levels. However, monitoring and evaluating such changes is very challenging. The EU Horizon 2020 project “Big Data against Childhood Obesity (BigO)” aims at gathering large-scale data from a large number of children using different sensor technologies to create comprehensive obesity prevalence models for data-driven predictions about specific policies on a community. It further provides real-time monitoring of the population responses, supported by meaningful real-time data analysis and visualisations. Since BigO involves monitoring and storing of personal data related to the behaviours of a potentially vulnerable population, the data representation, security, and access control are crucial. In this paper, we briefly present the BigO system architecture and focus on the necessary components of the system that deals with data access control, storage, anonymisation, and the corresponding interfaces with the rest of the system. We propose a three-layered data warehouse architecture: The back-end layer consists of a database management system for data collection, de-identification, and anonymisation of the original datasets. The role-based permissions and secured views are implemented in the access control layer. Lastly, the controller layer regulates the data access protocols for any data access and data analysis. We further present the data representation methods and the storage models considering the privacy and security mechanisms. The data privacy and security plans are devised based on the types of collected personal, the types of users, data storage, data transmission, and data analysis. We discuss in detail the challenges of privacy protection in this large distributed data-driven application and implement novel privacy-aware data analysis protocols to ensure that the proposed models guarantee the privacy and security of datasets. Finally, we present the BigO system architecture and its implementation that integrates privacy-aware protocols. MDPI 2021-03-28 /pmc/articles/PMC8037603/ /pubmed/33800574 http://dx.doi.org/10.3390/s21072353 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Shahid, Arsalan
Nguyen, Thien-An Ngoc
Kechadi, M-Tahar
Big Data Warehouse for Healthcare-Sensitive Data Applications
title Big Data Warehouse for Healthcare-Sensitive Data Applications
title_full Big Data Warehouse for Healthcare-Sensitive Data Applications
title_fullStr Big Data Warehouse for Healthcare-Sensitive Data Applications
title_full_unstemmed Big Data Warehouse for Healthcare-Sensitive Data Applications
title_short Big Data Warehouse for Healthcare-Sensitive Data Applications
title_sort big data warehouse for healthcare-sensitive data applications
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8037603/
https://www.ncbi.nlm.nih.gov/pubmed/33800574
http://dx.doi.org/10.3390/s21072353
work_keys_str_mv AT shahidarsalan bigdatawarehouseforhealthcaresensitivedataapplications
AT nguyenthienanngoc bigdatawarehouseforhealthcaresensitivedataapplications
AT kechadimtahar bigdatawarehouseforhealthcaresensitivedataapplications